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DESCRIPTION 
PROMOTER FOR SMOOTH MUSCLE CELL EXPRESSION 

BACKGROUND OF THE INVENTION 

The government owns rights in the present invention pursuant to grant numbers R01- 
HL48257, U01 AI34566 and R01HL51I45 from the Public Health Service. 

The present invention relates generally to the Fields of gene expression, particularly 
tissue specific expression, and more particularly smooth muscle cell specific expression. The 
invention also relates to cell proliferation diseases such as atherosclerosis, restenosis 
following balloon angioplasty and airway blockage in asthma. 

The phenotypic plasticity of smooth muscle cells (SMCs) permits this muscle cell 
lineage to subserve diverse functions in multiple tissues including the arterial wall, uterus, 
respiratory, urinary and digestive tracts. In contrast to fast and slow skeletal muscle cells 
which fuse and terminally differentiate before expressing contractile protein isoforms, SMCs 
are capable of simultaneously proliferating and expressing a set of lineage-restricted proteins 
including myofibrillar isoforms, cell surface receptors and SMC-restricted enzymes. 
Moreover, in response to specific physiological and pathophysiological stimuli, SMCs can 
modulate their phenotype by down-regulating a set of contractile protein genes, and in so 
doing, convert from the so called "contractile phenotype" to a de-differentiated "secretory 
phenotype" (Mosse et al. Lab Invest., 53:556-562, 1985; Owens et al, J. Cell Biol., 
102:343-352, 1986; Rovner et al, J. Biol Chem., 261:14740-14745, 1986; Taubman et al . ./ 
Cell Biol, 104:1505-1513, 1987; Ueki et al, Proc. Natl Acad. Sci. USA, 84:9049-9053, 1987; 
Belkin et al. 9 J. Biol Chem.. 263:6631-6635, 1988; Glukhova et al, Proc. Natl Acad. Sci. 
USA., 85:9542-9546, 1988: Chaponnier et al, Eur. J. Biochem., 190:559-565, 1990; Gimona et 
al, FEBS Letters, 274:159-162, 1990; Shanahan et al, Circ. Res., 73:193-204, 1993). 

This phenotypic modulation has been implicated in the pathogenesis of a number of 
disease states including atherosclerosis and restenosis following coronary balloon angioplasty 
(Ross, N. Engl. J. Med. 314:488-500, 1986; Schwartz et al. Circ. Res., 58:427-440. 1986; 
Zanellato et al, Arteriosclerosis, 1 0:996-1009, 1990; Ross. Am. J. Pathol, 43:987-1002. 1993: 
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Olson and Klein, Genes Dev., 8:1-8, 1994) and may also contribute to the airway remodeling 
seen in asthma (James et al. Am. Rev. Respir. Dis\, 139:242-246, 1989). Restenosis 
following coronary balloon angioplasty is a major problem, and contributes to the 40% failure 
rate of this procedure (Schwartz et al, 1992; Liu et al, Circ. 79:1374-1387, 1989). 
Restenosis occurs because the smooth muscle cells are stimulated to proliferate after 
angioplasty and thus block the arterial wall. Because of restenosis, balloon angioplasty is 
used mainly for palliation in patients who are not acceptable candidates for open heart 
surgery (Scientific American Medicine, Rubenstein and Federman, Eds., March 1993, Section 
1, XII, page 11). A method is needed, therefore, to control or inhibit the proliferation of 
smooth muscle cells after angioplasty. 

Although RDAd efficiently transduce both resting and proliferating SMCs in vivo, a 
potential limitation of their use in the clinical setting is their capacity to infect and program 
transgene expression in many different cell lineages and tissues (Ohno etigl, Science, 265 
(5173):781-784, 1994; Haddada et at., Current Topics in Microbiology & Immunology 199 
(Pt 3):297-306, 1995). For example, localized arterial administration of RDAd results in 
efficient infection of endothelial cells, vascular SMCs and adventitial cells (French et al. 
Circulation, 90 (5):2402-2413, 1994; Simari et al., J. Clin. Invest., 98 (l):225-235 5 1996). 
Moreover, intravenous administration of these vectors results in high-level gene transfer to 
the liver and lung (Kashyap et al., J. Clin. Invest. 96 (3): 1612-1620, 1995: Johns et al. 
J. Clin. Invest. 96 (2): 1 152-1 1 58, 1995; Miller and Vile, FASEB Journal, 9 (2):190-199. 
1995). Several approaches have been used in an attempt to circumvent this problem. First, it 
has been possible to restrict the expression of a viral transgene to a specific cell or tissue by- 
administering the virus ex vivo. However, this approach is laborious and is not practicaLfor 
the treatment of most vascular proliferative disorders. A second approach has involved 
delivery of adenoviral particles locally within the vasculature (to the site of vessel wall 
injury) or within a tissue (Ohno et al., 1994; Chang et al. Science, 267:518-522. 1995a: 
Guzman et al, 1994; Chang et al, Mol Medicine, 1:172-181, 1995b). Specially-modified 
catheter delivery systems including coated-balloons and intravascular stents have been 
designed in order to achieve high local concentrations of adenovirus within the vasculature 
(March et al, Human Gene Therapy, 6 (l):41-53. 1995: Rajasubramanian et al. ASAIO 
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Journal. 40 (3):M584-M589, 1994; Kito et al, ASA lO Journal 40 (3):M260-M266, 1994). 
However, the usefulness of these approaches may be limited within the human coronary 
circulation due to the high frequency of side branches. Moreover, such catheter delivery 
systems do not restrict transgene expression to specific cell types in the vessel wall. Finally, 
several groups have reported that the tissue-tropism of RDAd can be modified by 
electrostatically conjugating adenoviral proteins to ligands that can bind specifically to tissue- 
specific cell-surface receptors (Krasnykh et ai, Journal of Virology, 70 ( 10):6839-6846, 
1996). This approach has been used to successfully target RDAd to hepatocytes and 
hematopoietic progenitor cell lines (Schwarzenberger et al.. Blood, 87 (2):472-478, 1996). 

The use of tissue-specific transcriptional regulatory elements represents an alternative 
strategy to restrict adenoviral transgene expression to specific cell lineages or tissues in vivo 
(Miller and Vile, 1995). While theoretically appealing, this strategy is potentially limited 
because the adenovirus genome contains multiple highly active transcriptional enhancers that 
are capable of transactivating a variety of different promoters in multiple cell lineages 
(Haddada et al. 9 1995). Such a targeting strategy is particularly problematic in smooth 
muscle cells because of the lack of smooth muscle cell-specific transcriptional regulatory 
elements that function in vivo. Thus, there is still a need for discovery of a smooth muscle cell 
specific promoter that is not expressed in other types of cells and is constitutively expressed in 
both quiescent and proliferating cells and that maintains its tissue specificity when administered 
to an animal. 

SUMMARY OF THE INVENTION 

The present invention seeks to overcome these and other drawbacks in the prior art by 
providing a promoter that is capable of expression of a heterologous gene in a tissue specific 
manner, in particular, smooth muscle cells, and by offering the further advantage that the 
control of expression directed by the promoter is constitutive and cell cycle independent. The 
promoter of the present invention thus promotes transcription in both resting and proliferating 
cells, in contrast to other known smooth muscle cell promoters that are down-regulated in 
proliferating cells. This promoter may be used therefore, to express heterologous proteins or 
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mRNAs in proliferating smooth muscle cells and to control proliferative diseases or to 
promote angiogenesis, for example. 

The invention may be described, in certain embodiments, as an isolated nucleic acid 
segment comprising an SM22ot promoter sequence operatively linked to a heterologous gene 
and capable of directing expression of that gene. The isolated SM22ct promoter may be 
described as the region immediately upstream of the transcriptional start site of the murine 
SM22a gene. As described herein a nucleic acid segment having a sequence according to 
bases 899-1382 of SEQ ID NO:l, is also effective to promote transcription in a smooth 
muscle cell and a nucleic acid segment having that sequence or the transcriptional control 
elements of that sequence would also fall within the scope of the claimed invention. Such 
homologous promoters may be isolated from an animal sequence, such as from a mouse, pig, 
rat, hamster, rabbit or even a human genome or cDNA library using any of the sequences 
disclosed herein as a molecular probe. In addition, based on the present disclosure, one of 
skill might construct such a promoter by splicing elements taken from various sources 
including, but not limited to, chemically synthesized nucleic acid molecules, or elements 
removed from other naturally occurring promoters, or from the SM22a promoter. It is 
understood that any such promoter, or a promoter having the essential elements of the 
promoter disclosed herein and useful to express a heterolgous nucleic acid sequence would be 
encompassed by the spirit and scope of the invention claimed herein. 

The promoter region of the present invention may be defined as comprising that 
region of the genome immediately upstream (5') of the structural SM22a gene, and 
controlling expression of that gene. For example, the promoter may comprise the region of 
up to 30, 40, 50, 100, 500, 1,000, 1,500, 2,000 or even up to 5,000 bases directly upstream of 
the transcriptional start site of the SM22ct gene, and more specifically, an SM22ct promoter 
of the present invention may be described as an isolated nucleic acid segment that comprises 
a contiguous sequence of bases 1-1381 (-1338 to +41) of SEQ ID NO:l. The designations of 
-1 338 to +41 and the like indicate the position of a base relative to the transcriptional start site 
(+1), which, in the murine genome, is disclosed herein to be base 1341 of SEQ ID NO:l. The 
promoter of the present invention may also be described as an isolated nucleic acid segment 
that comprises a contiguous sequence of bases 899-1381 (-441 to +41) of SEQ ID NO:l. 
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Certain elements of the promoter that are identified in light of the present disclosure are a 
TATA box 29-bp 5* of the start site, five consensus E boxes/bHLH myogenic transcription 
factor binding sites located at bps -534, -577, -865, -898, -910, and -1267, three consensus 
GATA-4 binding sites located at bps -504, -828, -976, two AT-rich, potential MEF-2/rSRF 
binding sites located at bps -407 and -770 and at least one c/s-acting, positive transcriptional 
regulatory element contained by bp -435 to -416. In addition, the promoter of the present 
invention contains consensus CArG/SRF binding sites located at bps -150 and -273 and one 
CACC box located at bp -104. 

Thus, the promoter of the present invention may comprise some or all of the elements 
described in the previous paragraph. Such elements may be isolated and recombined by 
techniques well known in the art to produce a smooth muscle cell specific promoter that may 
be smaller than the 441 to 482 bases disclosed herein as a minimal sequence required for 
constitutive smooth muscle cell transcription. It is also known that certain stretches of 
sequence in the promoter are required for spacing of the cis acting elements and that any 
sequence that does not impart hairpin loops or other deleterious structural properties may be 
substituted for those regions so long as the spacing and conformation remains the same. It is 
understood that all such promoters would be encompassed by the present invention. 

The isolated nucleic acid segments of the present invention may also be defined as 
comprising a nucleic acid sequence or even a gene operatively linked to an isolated SM22a 
promoter sequence. Operatively linked is understood to mean that the gene is joined to the 
promoter region such that the promoter is oriented 5' to the gene and is of an appropriate 
distance from the transcription start site, so that the transcription of the gene will be 
dependent on or controlled by the promoter sequence. The arts of restriction enzyme 
digestion and nucleic acid ligation to be used in construction of a promoter-gene construct are 
well known in the art as exemplified by Maniatis et aL, Molecular Cloning, A Laboratory 
Manual. Cold Spring Harbor, New York, 1982, (incorporated herein by reference). Therefore 
one would, using standard techniques, prepare a gene by restriction enzyme digestion to have 
a compatible end sequence, or even a blunt end, to be ligated downstream of the SM22a 
promoter. The restriction enzyme recognition site may be a naturally occurring sequence, or 
a sequence generated by site directed mutagenesis, by a PCR™ primer sequence or by any 
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other means known in the art. Alternatively, one might chemically synthesize a gene or gene 
fragment or an oligonucleotide containing an appropriate restriction enzyme recognition 
sequence or one might prepare a gene by any of several methods known in the art. 

The gene or nucleic acid segment may be, for example, a structural gene that encodes 
a full length protein, a portion or part of a protein, or a peptide that one desires to express in a 
smooth muscle cell. The gene may also encode an RNA sequence, such as an antisense 
oligonucleotide sequence, or even a regulatory sequence that affects the expression of another 
gene or genes. In certain preferred embodiments of the invention, the gene will be a cell 
cycle control gene, such as a retinoblastoma (Rb) gene, a phosphorylation deficient Rb gene, 
p53, p21, pi 6, p27, a cell cycle dependent kinase inhibitor, E2F inhibitor, a CDK kinase or a 
cyclin gene; alternatively the gene will be an angiogenesis gene such as VEGF, iNOS, eNOS. 
basic FGF or FGF-5, or the gene may be a cytotoxic gene such as a herpes simplex thymidine 
kinase gene, or any other gene, the expression of which will affect proliferation of the smooth 
muscle cells in which the gene is expressed, and/or endothelial cells in such a ways as to 
effect the growth of new blood vessels. Alternatively, the nucleic acid segment may encode 
an antisense RNA effective to inhibit expression of a cell cycle controi gene or regulatory 
element. Antisense constructs are oligo- or polynucleotides comprising complementary 
nucleotides to the control regions or coding segments of a DNA molecule, such as a gene or 
cDNA. Such constructs may include antisense versions of both the promoter and other 
control regions, exons, introns and exon:intron boundaries of a gene. Antisense molecules 
are designed to inhibit the transcription, translation or both, of a given gene or construct, such 
that the levels of the resultant protein product are reduced or diminished. Antisense RNA 
constructs, or DNA encoding such antisense RNAs, may be employed to inhibit gene 
transcription or translation or both within a host cell, either in vitro or in vivo, such as within 
a host animal, including a human subject. Of course, the antisense constructs have evident 
utility in the types of nucleic acid hybridization described herein. The gene may also encode 
an antigenic sequence and the necessary leader sequence for transport to the cell surface, or it 
may encode an enzyme, or an intracellular signal protein or peptide, or it may even encode an 
SM22a gene or SM22a cDNA gene. Particularly preferred is a constitutively active form of 
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the Rb gene product that inhibits cellular proliferation, disclosed in Chang et al., 1995a 
(incorporated herein by reference). 

The present invention may also be described, in certain embodiments, as a 
recombinant vector that is capable of replication in an appropriate host cell and that 
comprises an SM22ct promoter sequence as disclosed herein, including an SM22cc promoter 
operatively linked to a gene or nucleic acid segment. Preferred vectors include, but are not 
limited to. a plasmid, a raus sarcoma virus (RSV) vector, a p21 viral vector, an adeno- 
associated viral vector or an adenoviral vector. In addition, a variety of viral vectors, such as 
retroviral vectors, herpes simplex virus (U.S. Patent 5,288,641, incorporated herein by 
reference), cytomegalovirus, and the like may be employed, as described by Miller 
(Microbiol. Immunol., 158:1, 1992, incorporated herein by reference). Recombinant adeno- 
associated virus (AAV) and AAV vectors may also be employed, such as those described in 
U.S. Patent 5,139,941, incorporated herein by reference. Recombinant adenoviral vectors are 
currently preferred. Techniques for preparing replication-defective infective viruses are well 
known in the art, as exemplified by Ghosh-Choudhury & Graham, Biochem. Biophys. Res. 
Comm.. 147:964-973 (1987); McGrory et al., Virology, 163:614-617,(1988); and Gluzman et 
al, In: Eukaryotic Viral Vectors (Gluzman, Y., Ed.) pp. 187-192, Cold Spring Harbor Press, 
Cold Spring Harbor, New York, (1982), each incorporated herein by reference. Also preferred 
are plasmid vectors designed for increased expression such as those described in Tripathy et 
al, Proc. Natl. Acad. Sci. USA, 93:10876-80, 1996. 

A preferred adenovirus used in the practice of the present invention is replication- 
defective and particularly those that lack the early gene region El or the early gene regions 
El and E3. For example, the foreign DNA of interest, such as the smooth muscle specific 
transcriptional regulatory sequence of the present invention may be inserted into the region of 
the deleted El and E3 regions of the adenoviral genome. In this way. the entire sequence is 
capable of being packaged into virions that can transfer the foreign DNA into an infectable 
host cell. A preferred adenovirus is a type 5 adenovirus and a SM22a promoter and coding 
sequence are preferably flanked by adenovirus type 5 sequences. 

The invention may be described in certain embodiments as a replication deficient 
adenoviral vector, wherein the vector comprises a smooth muscle cell specific transcriptional 
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regulatory segment. Preferred smooth muscle cell specific transcriptional regulatory- 
segments include, but are not limited to an SM22a promoter, a smooth muscle calponin 
promoter, a smooth muscle myosin heavy chain promoter, a smooth muscle alpha actin 
promoter, a smooth muscle alpha actin enhancer, a telokin promoter, a smooth muscle 
gamma-actin promoter or a smooth muscle gamma-actin enhancer. In addition, the enhancer 
elements may be included in an adenoviral vector of the present invention in combination 
with a promoter segment. A smooth muscle cell specific transcriptional regulatory segment 
of the present invention may be isolated from any source such as a mammal or a bird, for 
example, and including but not limited to a mouse, pig, rat, rabbit, human or chicken. Specific 
examples of such segments would include a mouse smooth muscle calponin promoter (bases 1 - 
1216 of Genbank accession #U38929, or the promoter sequence of Genbank accession 
#U3707L or bases 1-631 of Genbank accession #L49022); a smooth muscle myosin heavy 
chain promoter from a mouse (bases 1-1536 of Genbank accession # U53469), a rat (bases 1- 
1699 of Genbank accession #U55179, or bases 1-2425 of Genbank accession #U83321), or a 
rabbit (bases 1-2267 of Genbank accession #U 155 14); a human smooth muscle alpha actin 
promoter (bases 1-1 746 of Genbank accession #D0061 8, or bases 3 -892 of Genbank accession 
#J05193) or a human smooth muscle alpha actin enhancer (bases 1789-5559 of Genbank 
accession #D006 1 8); a chicken smooth muscle alpha actin promoter (bases I - 1 0 1 3 of Genbank 
accession #M 1 3756, D0004 1 , N00041 ), a mouse smooth muscle alpha actin promoter (bases 1 - 
1074 of Genbank accession #M57409, M35194), a rat smooth muscle alpha actin promoter 
(Genbank accession #S7601 1); a rabbit telokin promoter (bases 1-3460 of Genbank accession 
#U407 1 2); a mouse smooth muscle gamma-actin promoter (bases 1 - 1 076 of Genbank accession 
#U 19488) or a mouse smooth muscle gamma-actin enhancer (bases 1 123-5703 of Genbank 
accession #U 19488). (All sequences discussed in this paragraph incorporated herein by 
reference). 

The smooth muscle cell specific regulatory element or segment may be operatively 
linked to a gene or nucleic acid segment as defined above, i.e. it may be a cell cycle control 
gene, such as a retinoblastoma (Rb) gene, a phosphorylation deficient Rb gene, p53, p21, p16 ; 
p27, a cell cycle dependent kinase inhibitor, E2F inhibitor a CDK kinase or a cyclin gene; 
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alternatively the gene will be an angiogenesis gene such as VEGF, iNOS, eNOS, basic FGF or 
FGF-5, or the gene may be a cytotoxic gene such as a herpes simplex thymidine kinase gene. 

In certain embodiments of the invention, the vector of the present invention is 
dispersed in a pharmaceutical ly or pharmacologically acceptable solution. Preferred solutions 
include neutral saline solutions buffered with phosphate, lactate, Tris, and the like. Of 
course, one will desire to purify the vector sufficiently to render it essentially free of 
undesirable contaminant, such as defective interfering adenovirus particles or endotoxins and 
other pyrogens such that it will not cause any untoward reactions in the individual receiving 
the vector construct. A preferred means of purifying the vector involves the use of buoyant 
density gradients, such as cesium chloride gradient centrifiigation. 

The present invention may also be described, in certain embodiments, as a method of 
expressing a gene in a smooth muscle cell comprising the steps of: obtaining an isolated 
nucleic acid segment comprising a gene operatively linked to an SM22a promoter region; 
transferring that nucleic acid segment into a smooth muscle cell; and maintaining the smooth 
muscle cell under conditions effective to express the gene. In this method of the invention, 
the SM22cc promoter region preferably includes bases -441 to +41 of the SM22a gene (899- 
1382 of SEQ ID NO:l) or even bases -441 to +1 of the murine SM22a gene (899-1341 of 
SEQ ID NO:l) and may include up to 5,000 bases of the SM22a promoter. In the practice of 
this method, the heterologous gene is preferably a reporter gene, a cell cycle control 
regulatory gene, an angiogenesis gene, a virally encoded gene such as herpes simplex virus 
thymidine kinase (Chang, et. al., 1995b), an antisense molecule, or it may encode a muscle 
contraction inhibiting peptide, and may encode an Rb gene product or a peptide having the 
sequence MIRICRKK, SEQ ID NO: 19, or any gene or obvious variant of any gene as 
described above. The Rb gene may be the wild type Rb gene or it may be an altered gene 
such that the gene product is phosphorylation deficient. It is apparent from the present 
disclosure that it may not be necessary to collect the gene product in the practice of the 
present method. For example, if the gene product is a cell cycle regulatory element, or a 
contraction inhibiting peptide, then the cell itself will be the target of that effect and the utility 
of the method will not depend on collecting or even on identifying a protein product. 
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However, certain gene products will have utility as markers of gene expression and as useful 
proteins or peptides produced by a recombinant cell. 

In addition, the present invention may be described as a method of inhibiting smooth 
muscle cell proliferation comprising the steps of: obtaining an isolated nucleic acid segment 
comprising a cell cycle regulatory gene operatively linked to an SM22a promoter region; 
transferring the nucleic acid segment into a smooth muscle cell to obtain a transfected cell: 
and maintaining the smooth muscle cell under conditions effective to express the cell cycle 
regulatory gene; wherein expression of the cell cycle regulatory gene inhibits proliferation of 
the smooth muscle cell. In the practice of the method, the cell cycle regulatory gene 
operatively linked to an SM22a promoter region may comprise a viral vector, a plasmid 
vector or it may comprise an adenoviral vector. Further, the cell cycle regulatory gene mav 
preferably encode Rb, a phosphorylation deficient Rb gene, p53, p21, pl6, p27. a cell cycle 
dependent kinase inhibitor, E2F inhibitor, a CDK kinase or a cyclin gene, for example. 

The present invention may also be described in certain broad aspects as a method of 
preventing restenosis in a subject following balloon angioplasty of either a coronary artery, 
renal artery, peripheral artery or carotid artery, for example. In addition, the present invention 
may be described in certain broad embodiments as a method of preventing restenosis in a 
subject following balloon angioplasty of a vein as would be used in a coronary artery bypass 
surgery, or other bioprosthetic grafts that might be used in the periphery. This method 
comprises the steps of obtaining a viral vector comprising a cell cycle regulatory gene 
operatively linked to an SM22a promoter region dispersed in a pharmaceutically acceptable' 
solution and administering the solution to the subject. The subject may be an animal subject 
and is preferably a human subject. In the practice of the method, the viral vector is preferably 
a replication defective adenoviral vector and the gene may preferably encode herpes simplex 
thymidine kinase, p53, Rb, a phosphorylation deficient Rb gene, p53, p21, pi 6. p27, a cell 
cycle dependent kinase inhibitor, E2F inhibitor, a CDK kinase or a cyclin gene. 

An aspect of the invention is also a method of screening for identifying smooth 
muscle ceil specific transcriptional control elements and particularly those elements that work 
in tram. The method as provided herein preferably employs a reporter gene that confers on 
its recombinant hosts a readily detectable phenotype that is cither expressed or inhibited, as 
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the case may be. Generally reporter genes encode (a) a polypeptide not otherwise produced 
by the host cell; or (b) a protein or factor produced by the host cell but at lower levels; or (c) a 
mutant form of a polypeptide produced by the host cell. Preferably the gene encodes an 
enzyme which produces colorimetric or fluorometric change in the host cell which is 
detectable by in situ analysis and which is a quantitative or semi-quantitative function of 
transcriptional activation. Exemplary enzymes include esterases, phosphatases, proteases 
(tissue plasminogen activator or urokinase) and other enzymes capable of being detected by 
activity that generates a chrornophore or a fluorophore as will be known to those of skill in 
the art. 

Examples of such a reporter gene are the E. colt p-galactosidase (P-gal) and firefly 
luciferase genes. The p-gal enzyme produces a color change upon cleavage of the 
indigogenic substrate, indolyl-P-D-galactoside by cells expressing P-galactosidase. Thus, this 
enzyme facilitates automatic plate reader analysis of expression directly in microtiter wells 
containing transformants treated with candidate activators. Also, since the endogenous P> 
galactosidase activity in mammalian cells ordinarily is quite low, the analytic screening 
system using p-galactosidase is not hampered by host cell background. This enzyme offers 
the further advantage that expression can be monitored in vivo by tissue analysis as described 
below. 

Another class of reporter genes that confers detectable characteristics on a host cell 
are those that encode polypeptides, generally enzymes, that render their transformants 
resistant against toxins, e.g., the neo gene, which protects host cells against toxic levels of the 
antibiotic G418; a gene encoding dihydrofolate reductase, which confers resistance to 
methotrexate, or the chloramphenicol acetyl transferase (CAT) gene. Other genes for use in 
the screening assay herein are those capable of transforming hosts to express unique cell 
surface antigens, e.g. viral env proteins such as HIV gp!20 or herpes gD, which are readily 
detectable by immunoassays. 

In certain embodiments, the present invention may be described as a recombinant 
vector comprising an isolated SM22a promoter positioned adjacent a gene in a position to 
control expression of the gene. The splicing of nucleic acid sequences is well known in the 
art as described above and the insertion of such genes into vectors is also well known in the 
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art. The vector of the present invention may be a plasmid, a phagemid, a replication defective 
adenovirus, an adeno-associated virus or a retrovirus, for example. The type of vector does 
not necessarily in and of itself define the present invention, and therefore in certain 
embodiments, any vector that can transfer genetic material into a cell to be expressed in that 
cell will be useful in the present invention. It is also understood that the nucleic acid 
segments may be transferred into a cell by means such as liposomes, receptor ligand carriers, 
mechanical means such as electroporation, etc. and that all such embodiments are 
encompassed within the claimed invention. 

However, the recombinant vector of the present invention preferably is a replication 
deficient adenovirus or a high expression plasmid comprising an SM22a promoter 
operatively joined to a gene, and wherein the gene may be a cell cycle control gene, such as a 
retinoblastoma (Rb) gene, a phosphorylationdeficient Rb gene,p53, p21, pi 6, p27. a cell cycle 
dependent kinase inhibitor, E2F inhibitor, a CDK kinase or a cyclin gene; alternatively the gene 
may be an angiogenesis gene such as VEGF, iNOS, eNOS, basic FGF or FGF-5: or the gene 
may be a cytotoxic gene such as a herpes simplex thymidine kinase gene. 

It is understood that the method of inhibiting muscle contraction will have utility in 
the treatment of palliation of a variety of diseases that arise from muscle cell contraction. 
Such diseases include, but are not limited to Prinzmetal's angina, hypertension, Raynaud's 
phenomenon, migraine headache, a variety of collagen vascular diseases such as ELS, 
scleroderma, pulmonary hypertension, coronary arterial vasospasm, in contractile disorders of 
smooth muscle cells in the eye, gut, uterus, bladder, spleen, etc., or even in striated muscle 
spasms in paralysis victims. 

In a certain broad aspect the present invention may be described as a method of 
promoting angiogenesis in a subject comprising the steps of obtaining a nucleic acid segment 
comprising an angiogenesis factor gene operatively linked to an SM22a promoter region; and 
transferring the nucleic acid segment into a smooth muscle cell to obtain a transfected cell; 
wherein expression of the nucleic acid segment in the smooth muscle cell promotes 
angiogenesis. In the practice of the method, the smooth muscle cell may be a coronary 
arterial or venous smooth muscle cell, or it may be a peripheral arterial or venous smooth 
muscle ceil. A preferred angiogenesis factor is VEGF, iNOS, eNOS, basic FGF or FGF-5. In 
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certain embodiments of the method, the nucleic acid segment comprising an angiogenesis 
factor gene operatively linked to an SM22ot promoter region is contained in a viral or plasmid 
vector and the vector is administered to a subject. In certain alternate embodiments, the 
transferring is done ex vivo and the method further comprises the steps of seeding a 
5 bioprosthetic graft or stent with the transfected cells to obtain a seeded graft or stent; and 
placing the seeded graft or stent into a coronary or peripheral artery or vein of a subject. 

The present invention may also be described in certain broad aspects as a method of 
inhibiting smooth muscle proliferation comprising the steps of obtaining a nucleic acid 
segment comprising a cell cycle regulatory gene operatively linked to an SM22ct promoter 

10 region; transferring the nucleic acid segment into a primary smooth muscle cell in vivo or ex 
vivo to obtain a transfected cell; seeding a bioprosthetic graft or stent with the transfected cell 
to obtain a seeded graft or stent; and placing the seeded graft or stent into a coronary or 
peripheral artery or vein of a subject, wherein expression of the ceil cycle regulatory gene 
inhibits proliferation of a smooth muscle cell. 

15 The GenBank accession number for the murine SM22a cDNA is L41154. The 

GenBank accession number for the murine SM22a gene is L41 161 . 

BRIEF DESCRIPTION OF THE DRAWINGS 

20 FIG. 1 Cellular-specificity of the 441-bp SM22ot promoter. The p-441 SM221uc 

(black bar) and pRSVL (hatched bar) plasmids were transiently transfected into primary rat 
aortic SMCs (VSMC) ? A7r5, NIH 3T3 (3T3), COS-7, and Hep G2 cells and the normalized 
luciferase activities for each respective plasmid was determined. Data are expressed as 
normalized luciferase light units + S.E.M. 

25 FIG. 2 Schematic representation of the c/.v-acting elements and the /ram-acting 

factors identified by DNase I footprint and EMSAs analyses that bind to the SM22a 
promoter. Six nuclear protein binding sites were identified by DNase I footprint analysis in 
the 441-bp arterial SMC-specific SM22a promoter that were designated SME-1-6, 
respectively. 7/Y7/?.v-acting factors identified by EMSA that bind to each nuclear protein 

30 binding site are shown above or below each c/.s-acting element. Binding sites for SRF and 
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ternary complex factors (T) (SME-1 and SME-4), Spl (SME-1, -2 ? -5, -6), YY1 (SME-3, -4, - 
6), CREB-1 (SME-6) and ATF-1 (SME-6) were identified. Nuclear protein complexes that 
could not be positively identified by antibody supershift reactions are shown in gray below 
the nuclear protein binding site to which they bind. 

FIG. 3 A. Schematic representations of the AdSM22-lacZ and AdCMV-lacZ 
adenoviral vectors. The AdSM22-lacZ vector (upper panel) encodes the bacterial lacZ 
reporter gene (black box) and bovine growth hormone polyadenylation signal (box with 
horizontal lines) under the transcriptional control of the 441 -bp murine SM22ct promoter 
(box with diagonal lines) and the 450-bp human 4F2 transcriptional enhancer (white box). 
The El and E3 regions of the Ad5Sub360 adenoviral genome were deleted (AF3) rendering 
the virus replication-defective. The AdCMV-lacZ control virus encodes the lacZ reporter 
gene (black box) under the transcriptional control of the cytomegalovirus immediate early 
gene promoter enhancer (box filled with dots). 

FIG. 3B. Comparison of the activity of AdSM22-lacZ and the AdCMV-lacZ control 
virus in primary cultures of rat aortic smooth muscle cells (VMSC) and human umbilical vein 
endothelial cells (HUVEC). Primary cultures of VSMCs or HUVECs were infected with 1-, 
10-, and 100-PFU of either AdSM22-lacZ (black squares-HUVEC and black circles- VSMC) 
or AdCMV-lacZ (open squares -HUVEC and open circles-VSMC) and the % of cells 
expressing p-galactosidase activity was quantitated 72-h post-infection. Data are expressed 
as the mean percentage of cells expressing pgal activity ± S.E.M. 

DETAILED DESCRIPTION 
OF THE PREFERRED EMBODIMENTS 

The present invention arises from the isolation and characterization of a smooth 
muscle cell specific promoter region that, in its naturally occurring state, controls the 
expression of the SM22a gene, and the discovery that this isolated promoter region may be 
operatively joined to a heterologous structural gene and will control the expression of that 
gene specifically in smooth muscle cells and other myogenic cell lineages including an 
embryonic skeletal muscle cell. An important element of the present invention is that, unlike 
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other known smooth muscle cell promoters, the SM22a promoter is cell cycle independent 
and is thus not down-regulated when the cell enters the proliferative state. The promoter 
sequence of the present invention will be useful in the expression of heterologous genes in a 
smooth muscle celL in the discovery of trans and cis acting transcriptional control elements 
that affect smooth muscle cell gene expression and as a probe to isolate SM22oc genes and 
promoters. In particular, the present invention will find use in the prevention of restenosis 
following balloon angioplasty or other arterial injury, in the treatment of artherosclerosis or 
peripheral vascular occlusive disease, in the promotion of angiogenesis in graft or stent 
implants and in the treatment or prevention of asthma, among other smooth muscle cell 
proliferative diseases. 

As an embodiment of the present invention, a recombinant replication-defective 
adenoviral vector (RDAd) was generated, designated AdSM22-lacZ, which contains the 
bacterial lacZ reporter gene under the transcriptional control of the SMC-specific SM22a 
promoter (Solway et aL, J. Biol. Chern., 270 (22): 13460-13469, 1995; Kim et aL Mol Cell 
Biol, (17):2266-2278, 1997; Li et al, J. Cell Biol, 132:849-859, 1996b). As shown herein, 
this adenoviral construct, AdSM22-lacZ, programs SMC -specific expression of the lacZ 
reporter gene in cultured cells. In addition, the SMC-specificity of the AdSM22-lacZ virus is 
maintained in vivo following intra-arterial, inlra-muscular and intravenous administration. 
Finally, AdSM22-lacZ programs transgene expression in visceral, as well as vascular SMCs 
in vivo. 

It is an important discovery, as disclosed herein, that the expression of a recombinant 
gene product encoded by a RDAd can be regulated in a cell-lineage restricted fashion by a 
transcriptional regulatory element in normal cells in vivo. The present inventors have 
previously generated adenoviral vectors containing other cell lineage-specific transcriptional 
regulator}' elements and observed that the majority of these elements lose their cell lineage- 
specificity when tested in vivo in the context of an adenoviral vector. Similarly. Arbuthnot et. 
al. reported that RDAd containing transgenes under the transcriptional control of the alpha- 
fetoprotein (AFP) promoter are capable of mediating cell lineage-restricted gene expression 
in hepatoma cells, but not in normal liver parenchyma (Arbuthnot et al. Human Gene 
Therapy, 7 (1 3): 1 503- 1 5 1 4, 1996). Without restricting the present invention to any particular 
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theory, it is possible that the murine SM22a promoter may contain an insulator element or a 
locus control region that is capable of protecting it from the activity of cryptic transcriptional 
regulatory elements located within the adenoviral genome. In this regard, the murine SM22oc 
promoter is as active as the most potent viral LTRs in SMCs (Solway et al 9 1995), and 
functions in a SMC-specific fashion both in vitro and in transgenic mice in vivo (Kim et al. 9 
1997; Li et al, 1996b; Moessler et aL, Development, 122:2415-2425, 1996). It is 
contemplated, therefore, that an insulator element, when isolated from an SM22ct promoter 
will be useful as a means of conferring tissue specificity to those tissue specific promoters 
that lose tissue specificity when expressed from an adenoviral vector. 

Furthermore, in transgenic mice the SM22oc promoter is active in arterial, but not 
visceral SMCs (Li et al., Cira Res., 78:188-195, 1996a; Kim et al, 1997). Therefore, the 
demonstration described herein that the lacZ reporter gene encoded by AdSM22-lacZ was 
expressed in visceral, as well as, vascular SMCs was somewhat surprising. Again, without 
relying on any particular theory, this difference may reflect the fact that in adenovirus- 
infected cells DNA remains episomal, whereas, in transgenic mice it is integrated into the 
host genome where its transcriptional activity can be-modulated by alterations in chromatin 
structure. 

RDAd are particularly useful tools for studying the molecular pathogenesis of 
atherosclerosis and other vascular proliferative disorders. Adenoviruses can be delivered in 
spatially- and temporally-restricted fashions to the vessel wall in both normal and 
atherosclerotic vessels in large and small animals (French et aL, 1994; Simari et aL, 1996). 
However, previous studies using these vectors to investigate the pathogenesis of vascular 
proliferative disorders have not been able to distinguish effects due to alterations in vascular 
SMC gene expression from those resulting from transgene expression in endothelial or 
adventitial cells. In this respect, RD Ad-containing transgenes under the control of the SM22a 
promoter allow one to determine directly the effects of SMC-specific transgene expression on 
vascular SMC proliferation and neointima formation. In addition, because SM22a containing 
RDAd program transgene expression in visceral SMCs. these viruses may also be useful to 
examine the pathogenesis and treatment of diseases mediated by visceral SMCs. Examples of 



WO 98/15575 



PCT/US97/16204 



- 17- 

such diseases include asthma, achalasia, leiomyosarcomas, irritable bowel syndrome and 
uterine leiomyomas. 

Although the efficacy of RDAd have been established in both large and small animal 
models of vascular proliferative disease (Ohno et aL, 1994; Chang et aL, 1995a; Guzman et 
aL, 1994: Yang et al. Proa Natl Acad. Set USA, 93 (1 5):7905-791 0, 1996; Chang et al., 
1995b), safety concerns persist due to the capacity of these viruses to infect most cells and 
tissues. In this regard, SM22a promoter-driven adenoviruses may prove advantageous as 
vehicles to deliver therapeutic genes to the vessel wall for the treatment of vascular 
proliferative disorders. The lack of cytotoxic or cytostatic transgene expression in the 
endothelial cells at the margins of the arterial injury should, in theory, promote more rapid re- 
endothelialization of the injured vessel by facilitating the proliferation and migration of 
adjacent endothelial cells (Ohno et ai, 1994). Of equal importance, the potential for systemic 
toxicity resulting from ectopic expression of potentially toxic adenovirus-encoded transgenes 
in other tissues and organs should be reduced by use of the SMC-specific SM22ct promoter. 
Finally, recent reports have demonstrated that much of the immunogenicity of adenovirus 
infected cells is due to cellular and humoral immune responses directed against foreign 
transgene proteins (Kashyap et a!., 1995). By restricting ectopic expression of adenovirus- 
encoded transgenes in non-SMC containing tissues, AdSM22 viruses may also reduce the 
immune-mediated damage to organs such as the liver and the lung following intentional or 
inadvertent systemic administration of the vector, as indicated by the finding of decreased 
hepatic toxicity following IV administration of high dose AdSM22-lacZ. 

In one aspect, the present invention provides a process of directing and regulating 
gene expression in a smooth muscle cell. In accordance with that process, a gene operatively 
joined to an SM22a promoter is delivered to a smooth muscle cell and the smooth muscle 
cell is then maintained under physiological conditions and for a period of time sufficient for 
the gene to enter the smooth muscle cell, for the gene to be transcribed and in certain 
embodiments, for the product of that gene to be expressed. Delivery is preferably by 
transfection with a plasmid or a high expression plasmid, replication defective adenovirus, 
adeno-associated virus, p21 virus, raus sarcoma virus, or other virus vector construct capable 
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of transfecting a smooth muscle cell, and comprising an SM22a promoter operative!)/ joined 
to a coding sequence that encodes the gene product. 

The use of adenovirus as a vector for cell transfection is well known in the art. 
Adenovirus vector-mediated cell transfection has been reported for various cells (Stratford- 
Perricaudet, et al, J. Clin. Invest., 90, 626-630, 1992). An adenovirus vector of the present 
invention is replication defective. A virus is rendered replication defective by deletion of the 
viral early region 1 (El) region. An adenovirus lacking an El region is competent to replicate 
only in cells, such as human 293 cells, which express adenovirus early region 1 genes from 
their cellular genome. Thus, such an adenovirus cannot replicate in cells that do not provide 
the early gene product of the El region. In a preferred embodiment, an adenovirus vector 
used in the present invention is lacking both the El and the E3 early gene regions. Thus, it is 
most convenient to introduce the coding sequence for a gene product at the position from 
which the El and/or E3 coding sequences have been removed (Karlsson et al^ EMBO J., 5. 
2377-2385. 1986). Preferably, the El region of adenovirus is replaced by the coding DNA 
sequence or gene. However, the position of insertion within the adenovirus sequences is not 
critical to the present invention. Techniques for preparing such replication defective 
adenoviruses are well known in the art as exemplified by Ghosh-Choudhury et al. y 1987: 
McGrory et aL, 1988; and Gluzman et al, 1982. 

A wide variety of adenovirus vectors can be used in the practice of the present 
invention. An adenovirus vector can be of any of the 42 different known serotypes of 
subgroups A-F. Adenovirus type 5 of subgroup C is the preferred starting material for 
production of a replication-defective adenovirus vector. Adenovirus type 5 is a human 
adenovirus about which a great deal of biochemical and genetic information is known, and it 
has historically been used for most constructions employing adenovirus as a vector. 

In order to replicate the virus., the vector is co-transfected into 293 cells together with 
a plasmid carrying the complete adenovirus type 5 genome. Preferred plasmids may also 
confer ampicillin and tetracycline resistance due to insertion of the appropriate sequences into 
the virus genome. The molecular strategy employed to produce recombinant adenovirus is 
based upon the fact that, due to the packaging limit of adenovirus, the piasmid cannot 
efficiently form plaques on its own. Therefore, homologous recombination between the 



WO 98/15575 



- 19- 



PCT/US97/16204 



desired construct and the co-transfected plasmid within a transfected cell results in a viable 
virus that can be packaged and form plaques only on 293 cells. 

Co-transfection is performed in accordance with standard procedures well known in 
the art. By way of example, 293 cells are cultured in Dulbecco's modified Eagle's medium 
containing 10% fetal calf serum in a humidified 5% C0 2 atmosphere. Confluent 10 cm 
dishes are split into three 6 cm dishes. On the following day, the cells are cotransfected in 
calcium phosphate with HeLa DNA as carrier. Six hours after addition of the DNA to the 
cells, a 15% glycerol stock is used to boost transfection efficiency and the cells are overlaid 
with 0.65% Noble agar in DMEM containing 2% FCS, 50 mg/ml penicillin G, 10 mg/ml 
streptomycin sulfate, and 0.25 mg/ml fungizone (GIBCO, Grand Island, NY). Monolayers 
are incubated for approximately 10 days until the appearance of viral plaques. 

These plaques are picked, suspended in DMEM containing 2% FCS, and used to 
infect a new monolayer of 293 cells. When greater than 90% of the cells show infection, viral 
lysates are subjected to a freeze/thaw cycle and designated as primary stocks. Recombinant 
virus with the correct structure is verified by preparation of viral DNA from 
productively-infected 293 cells, restriction analysis, and Southern blotting. Secondary stocks 
are subsequently generated by infecting 293 cells with primary virus stock at a multiplicity of 
infection of 0.01 and incubation until lysis. 

The particular cell line used to propagate the recombinant adenoviruses of the present 
invention is not critical to the present invention. Recombinant adenovirus vectors can be 
propagated on, e.g., human 293 cells, or in other cell lines that are permissive for conditional 
replication-defective adenovirus infection, e.g., those which express adenovirus El gene 
products "in trans 11 so as to complement the defect in a conditional replication-defective 
vector. Further, the cells can be propagated either on plastic dishes or in suspension culture, 
in order to obtain virus stocks thereof. 

When the vector is to be delivered to an animal subject, a preferred method is to 
percutaneously infuse an adenovirus vector construct into a native or balloon-injured blood 
vessel that perfuses smooth muscle cells (WO 9411506; Barr et aL, Gene Therapy, 1 (1):51- 
58, 3994; both incorporated herein by reference) by intravenous or intra-arterial injection. 
Methods of delivery of foreign DNA are known in the art. such as containing the DNA in a 
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liposome and infusing the preparation into an artery (LeCierc et ai. Circulation 85:543, 1 992. 
incorporated herein by reference), transthoracic injection (Gal et al.. Lab Invest. 68:18-25. 
1993. incorporated herein by reference). Other methods of delivery may include coating a 
balloon catheter with polymers impregnated with the foreign DNA and inflating the balloon 
5 in the region of arteriosclerosis, thus combining balloon angioplasty and gene therapy (Nabel 
et al. Cardiovascular Research, 28:445-455, 1994, incorporated herein by reference). 

After delivery of an adenovirus vector construct to a smooth muscle cell, that cell is 
maintained under physiological conditions and for a period of time sufficient for the 
adenovirus vector construct to infect the cardiac cell and for cellular expression of a coding 
10 sequence contained in that construct. Physiological conditions are those necessary for 
viability of the smooth muscle cell and include conditions of temperature, pH, osmolality and 
the like. In a preferred embodiment, temperature is from about 20°C to about 50°C, more 
preferably from about 30°C to about 40°C and, even more preferably about 37°C pH is 
preferably from a value of about 6.0 to a value of about 8.0, more preferably from about a 
15 value of about 6.8 to a value of about 7.8 and, most preferably about 7.4. Osmolality is 
preferably from about 200 milliosmols per liter (mosm/L) to about 400 mosm/1 and, more 
preferably from about' 290 mosm/L to about 310 mosm/L. Other physiological conditions 
needed to sustain smooth muscle cell viability are well known in the art. 

It should also be pointed out that because the adenovirus vector employed is 

20 replication defective, it is not capable of replicating in the cells that are ultimately infected. 
Moreover, it has been found that the genomic integration frequency of adenovirus is usually 
fairly low, typically on the order of about 1%. Thus, where continued treatment is required, it 
may be necessary to reintroduce the virus every 6 months to a year. In these circumstances, it 
may therefore be necessary to conduct long term therapy, where expression levels are 

25 monitored at selected intervals. 

An adenovirus vector construct is typically delivered in the form of a pharmacological 
composition that comprises a physiologically acceptable carrier and the adenovirus vector. 
An effective expression-inducing amount of such a composition is delivered. As used herein, 
the term "effective expression-inducing amount" means that number of virus vector particles 

30 necessary to effectuate expression of a gene product encoded by a coding sequence contained 



WO 98/15575 



-21 - 



PCT/US97/16204 



in that vector. Means for determining an effective expression-inducing amount of an 
adenovirus vector construct are well known in the art. An effective expression-inducing 
amount is typically from about 10 7 plaque forming units (pfu) to about 10 15 pfu, preferably 
from about 10 8 pfu to about 10 14 pfu and, more preferably, from about 10 9 to about 10 12 pfu. 

As is well knownvin the art, a specific dose level for any particular subject depends 
upon a variety of factors including the infectivity of the adenovirus vector, the age, body 
weight, general health, sex, diet, time of administration, route of administration, rate of 
excretion, and the severity of the particular disease undergoing therapy. 

In that adenovirus is a virus that infects humans, there may be certain individuals that 
have developed antibodies to certain adenovirus proteins. In these circumstances, it is 
possible that such individuals might develop an immunological reaction to the virus. Thus, 
where an immunological reaction is believed to be a possibility, one may desire to first test 
the subject to determine the existence of antibodies. Such a test could be performed in a 
variety of accepted manners, for example, through a simple skin test or through a test of the 
circulating blood levels of adenovirus-neutralizing antibodies. In fact, under such 
circumstances, one may desire to introduce a test dose of on the order of 1 x 10 5 to 1 x 10 6 or 
so virus particles. Then, if no untoward reaction is seen, the dose may be elevated over a 
period of time until the desired dosage is reached, such as through the administration of 
incremental dosages of approximately an order of magnitude. 

In another aspect, the present invention relates to pharmaceutical compositions that 
may comprise an adenovirus vector gene construct dispersed in a physiologically acceptable 
solution or buffer. A composition of the present invention is typically administered 
parenterally in dosage unit formulations containing standard, well known, nontoxic, 
physiologically acceptable carriers, adjuvants, and vehicles as desired. The term parenteral as 
used herein includes intravenous, intramuscular, intraarterial injection, or infusion techniques. 

Injectable preparations, for example, sterile injectable aqueous or oleaginous 
suspensions are formulated according to the known art using suitable dispersing or wetting 
agents and suspending agents. The sterile injectable preparation can also be a sterile 
injectable solution or suspension in a nontoxic parenterally acceptable diluent or solvent, for 
example, as a solution in L3-butanediol. Among the acceptable vehicles and solvents that 
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may be employed are water, Ringer's solution, and isotonic sodium chloride solution. In 
addition, sterile, fixed oils are conventionally employed as a solvent or suspending medium. 
For this purpose any bland fixed oil can be employed including synthetic mono- or di- 
glycerides. In addition, fatty acids such as oleic acid find use in the preparation of 
injectables. 

Preferred carriers include neutral saline solutions buffered with phosphate, lactate, 
Tris, and the like. Of course, one purifies the vector sufficiently to render it essentially free 
of undesirable contaminant, such as defective interfering adenovirus particles or endotoxins 
and other pyrogens such that it will not cause any untoward reactions in the individual 
receiving the vector construct. A preferred means of purifying the vector involves the use of 
buoyant density gradients, such as cesium chloride gradient centrifugation. 

The following examples are included to demonstrate preferred embodiments of the 
invention. It should be appreciated by those of skill in the art that the techniques disclosed in 
the examples which follow represent techniques discovered by the inventor to function well 
in the practice of the invention, and thus can be considered to constitute preferred modes for 
its practice. However, those of skill in the art should, in light of the present disclosure, 
appreciate that many changes can be made in the specific embodiments which are disclosed 
and still obtain a like or similar result without departing from the spirit and scope of the 
invention. The following techniques and materials were used in the practice of the examples 
unless otherwise indicated. 

Isolation of Murine SM22a cDNA Clones 

The coding region of the murine SM22a cDNA was isolated by performing low 
stringency PCR™ using murine uterine RNA and synthetic 5' and 3* oligonucleotide PCR™ 
primers constructed from the previously published sequence of the rat SM22a cDNA 
(Nishida et al., Gene, 130:297-302, 1993). The 5' PCR™ primer was constructed to be 
identical lo the first 34-bp of the rat SM22cc cDNA with the addition of a 5' EcoRI site (5' 
ATCGAATTCCGCTACTCTCCTTCCAGCCC ACAAACGACCAAGC 3', SEQ ID NO: 10). 

The 3' primer was constructed to include the reverse complement of bp 759 to 782 of the ral 
SM22a cDNA with an additional 3' Hindlll restriction site (5' 
ATCAAGCTTGGTGGGAGCTGCCCATGTGCAGTC 3\ SEQ ID NO:l 1). PCR™ reaction 
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products were subcloned into EcoRIl 7//W///-digested pGEM7Z (Promega, Madison, WI) as 
described elsewhere (Parmacek and Leiden, J. Biol. Chem.:264: 1321 7-13225, 1989). The 
nucleotide sequence of the murine SM22ct cDNA was confirmed by sequencing of the full- 
length murine SM22a genomic clone. MacVector DNA sequencing software (Kodak/IBL 
Rochester, NY) was used for DNA sequence analyses. 

To isolate the 3* untranslated region of the SM22oc cDNA, 5 x 10 5 recombinant clones 
from an oligo-(dT) primed kgtll C2C12 myotube cDNA library were screened with the 
[ 32 P]-Iabeled murine SM22a cDNA probe (bp 29-81 1) as described previously (Parmacek et 
aL 9 Mol Cell Biol, 12:1967-1976, 1992). Twelve clones were purified to homogeneity and 
analyzed by Southern blot analyses as described (Parmacek et al, 1992). Two independent 
clones, each of which contained a poly(A) tail, were subcloned into £co/?/-digested pGEM7Z 
and their nucleotide sequences determined. The nucleotide sequence of the 5' -untranslated 
region was determined from the sequence of the SM22a genomic clone. The 5' -untranslated 
region was localized on the genomic clone by Southern blot hybridizations, in addition to 
RNase protection and primer extension analyses as described below. 

Isolation of Murine SM22ot Genomic Clones 

Approximately 1 x 10 6 recombinant phage from a murine 129SV Lambda FIX II 
genomic library (Stratagene, La Jolla, CA) were screened with the 783-bp murine SM22a 
cDNA probe (bp 29-811) labeled with |a- 32 P]dCTP, and three positive clones were purified 
to homogeneity as described previously (Parmacek et al. 9 1992). One clone (SM22-13a) was 
found to include the entire coding region of the SM22ct gene and 9-kb of 5' flanking sequence 
and was used for all subsequent subcloning and sequencing studies. 

Southern Blot Analyses 

High molecular weight DNA was prepared from the tails of strain 129SV mice as 
described previously (Parmacek et ai, 1989). Southern blotting and hybridization to the 
radiolabeled 783-bp murine SM22ot cDNA probe were performed as described previously 
(Parmacek et al, 1989). ). Low stringency washing conditions were 2X SSC, 0.1% SDS at 
50°C High stringency washing conditions were 0.1X SSC, 0.1% SDS at 68°C. 
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Northern Blot Analyses 

Tissues were isolated . from 12-week old 129SV mice (Jackson Laboratories) as 
described previously (Parmacek et al, 1989). Animals were housed and cared for according 
to NIH guidelines in the University of Chicago Laboratory Animal Medicine Veterinary 
Facility. RNA was prepared from organ samples and from cultures of primary rat aortic 
SMCs, the rat SMC line A7r5, and non-smooth muscle cell lines including murine NIH 3T3 
cells, murine C3H10T1/2 cells, monkey COS-7 cells, murine C2C12 myoblasts and 
myotubes, human HepG2 cells, and murine EL-4 cells by the single step guanidinium 
isothiocyanate protocol (Chomczynski, Biotechniques, 15:532-537, 1993). Northern blotting 
was performed using 10 mg of RNA per sample as described previously with the exception 
that 36 mg/ml of ethidium bromide was added to the RNA resuspension buffer in order to 
permit quantitation of the 28S and 18S ribosomal RNA subunits in each lane. Probes 
included the 783-bp (bp 29-81 1) murine SM22a cDNA and the 754-bp (bp 659-1404) murine 
calponin cDNA probe. Quantitative image analyses were performed using a Molecular 
Dynamics Phosphorlmager (Sunnyvale, CA). 

Primer Extension, 5' RACE, and RNase Protection Analyses 

A 25-mer oligonucleotide probe constructed to include the reverse complement of 
base pairs +80 to +104 of the SM22oc gene (5' TGCCGTAGGATGGACCCTTGTTGGC 3', 
SEQ ID NO: 12) was 5* end labeled with [y- 32 P]ATP and T4 polynucleotide kinase. 40 mg of 
mouse uterine RNA was hybridized to 2 x 1 0 6 DPM of labeled probe and primer extension 
reactions performed at 42°C, 50°C and 56°C as described previously (Parmacek et aL, 1992). 

5' RACE was performed using murine uterine RNA and a synthetic antisense cDNA probe 
corresponding to bp 234 to 258 of the murine SM22ct cDNA according to the manufacturer's 
instructions (Perkin Elmer, Norwalk, CT). RNase protection analyses were performed by 
subcloning the -441 to +41 murine SM22a genomic subfragment including a synthetic 3' 
HindW linker into Pstl/Hirtdlll-digested pGEM4Z and performing in vitro transcription of 
the antisense strand of the genomic subfragment with T7 polymerase of the TV'coMinearized 
plasmid (Ncol cuts at bp -88 of the genomic clone) in order to obtain an antisense cRNA 
probe corresponding to bp -88 to +44. The HindW linker shares sequence identity with the 
SM22a cDNA resulting in a cRNA probe with sequence identity initiated at bp +44 (not +41) 
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of the SM22a genomic clone. The 142-bp probe was labeled with [a- 32 P]UTP and RNase 
Protection Analyses were performed using the RPAII™ kit (Ambion, Austin, TX) according 
to the manufacturer's instructions. Antisense cRNA probe radiolabeled by incorporation of 
a-[ 32 P]-UTP is synthesized by in vitro transcription from linearized pBluescriptIIKST7-/tfcZ, 
which contains the lacZ gene upstream of the T7 RNA polymerase promoter, using the 
MaxiScript™ kit (Ambion, Austin, TX). Band intensity is quantified by Phosphorlmager™, 
as previously for southern analyses described above. 
Cell Culture 

The rat cell line A7r5 which was derived from embryonic thoracic aorta was grown in 
Dulbecco's Modified Essential Media (GIBCO) supplemented with 10% fetal bovine serum 
(GIBCO) and 1% penicillin/streptomycin. The human hepatocellular carcinoma cell line Hep 
G2 was grown in Modified Eagle's Medium supplemented with 10% fetal bovine serum and 
0.1 mM MEM non-essential amino acids (GIBCO). Murine lymphoma-derived EL4 cells 
were grown in Dulbecco's modified Eagle's Media supplemented with 10% horse serum 
(GIBCO). Murine NIH 3T3 cells, C3H10T1/2 cells, C2C12 myoblasts and myotubes were 
grown as described previously (Parmacek et al. y J. Biol. Chem., 265:15970-15976, 1990; 
Parmacek et al., Mol Cell Biol, 14:1870-1885, 1994). Primary cultures of rat aortic SMCs 
were isolated from 12-16 week old Sprague Dawley rats (Charles River Laboratories, 
Wilmington, MA) using the method described previously (Chang et al, Science, 1995a). 
Virtually all cells isolated using this method stain positive with anti-smooth muscle actin 
monoclonal antiserum. In all studies, only early passage (passage 2 or 3) rat aortic SMCs 
were utilized. For the cell cycle analyses, SMCs from the third passage were placed in 
serum-free medium (50% Dulbecco's minimal essential medium (DMEM), 50% Ham's F-12, 
L-glutamine (292 mg/ml), insulin (5 mg/ml), transferrin (5 mg/ml), selenious acid (5 ng/ml)) 
for 72 hrs in order to synchronize the cells in G 0 /G, as described previously (Chang et al, 
1995a). Following 72 hrs of serum starvation, cells were stimulated to proliferate by 
incubation in medium containing 45% DMEM, 45% Ham's F-12 and 10% FBS. Mouse 
WEHI B-cells and mouse 70Z/3 pre-B lymphocytes were grown as described previously 
(Morrisey et al. Dev. Biol, 177 p :309-322, 1996). 
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DNase I footprinting 

Nuclear extracts were prepared from the smooth muscle cell line, A7r5 (which express 
high levels of SM22a mRNA (Solway el ai, 1995)) and NIH 3T3 cells as described 
previously (Parmacek et ai, 1992). Three overlapping genomic subfragments (bp -441 to - 
256, bp -256 to -89, and bp -89 to +41) spanning the 482-bp SM22a promoter were analyzed. 
DNase I footprint analyses were performed with 100-150 mg of nuclear extracts prepared 
from the smooth muscle cell line, A7r5, or NIH 3T3 fibroblasts and the end-labeled sense and 
antisense strands of the murine SM22a promoter as described previously (Parmacek et ai. 
1994). Standard Maxam and Gilbert (G + A) sequencing reactions were run in parallel to 
identify the protected sequences. 
Electrophoretic mobility shift assays (EMSAs) 

Nuclear extracts were prepared from low passage number primary rat aortic SMCs. 
A7r5 cells, NIH 3T3 cells, C3H10T1/2 cells, C2C12 myotubes, WEHI, 70Z/3 and EL4 cells 
as described by Dignam et ai Nucleic Acids Res. 11: 1475, (1983). EMSAs were performed 
in 0.25X TBE (IX TBE is 100 mM Tris, 100 mM boric acid and 2 mM EDTA) as described 
previously (Ip et ai. Moi Cell. Bioi, 14:7517-7526, 1994). The following complementary 
oligonucleotides (corresponding to each nuclear protein binding site identified by DNasel 
footprint analysis or nuclear protein binding sites containing the specific mutations indicated 
(mutated nucleotides are underlined)) were synthesized with BamHI and BgUI overhanging 
ends: 

SME-1-5' AAGGAAGGGT TTCAGGGTCC TGCCCATAAA AGGTTTTTCC CGGCCGC 
3' (SEQ ID NO:21); 

j.iSME-1- 5'AAGGAAGGGT TTCAGGGTCC TGCCCATAGA TCTTTTTTCC CGGCCGC 
3' (SEQ ID NO:22); 

SME-2- 5' CCGCCCTCAG CACCGCCCCG CCCCGAGGCC CGCAGCATGT CCG 3' 
(SEQ ID NO:23); 

HSME-2- 5' CCGCCCTCAG CACCGCGGAT CCCCGACCCC CGCAGCATCT CCG 3' 
(SEQ ID NO:24): 
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SME-3- 5' CTCCAAAGCA TGCAGAGAAT GTCTCCGGCT GCCCCCG 3' (SEQ ID 
NO:25); 

uSME-3- 5' CTCGGAICCA TGC TAGC AAT GAATTCGGCT GCCCCCG 3' (SEQ ID 
NO:26); 

SME-4- 5' TCCAACTTGG TGTCTTTCCC CAAATATGGA GCCTGTGTGG AGTG 3' 
(SEQ ID NO:27); 

p.SME-4- 5' TCCAACTTGG TGTCTTTCCC CAAGGATCCA GCCTGTGTGG AGTG 
3'(SEQ ID NO:28); 

p.SRF/SME-4- 5' TCCAACTTGG TGTCTTTCCC CGGATATGGA GCCTGTGTGG 
AGTG 3'(SEQ ID NO:29); 

p.YYl/SME-4- 5' TCCAACTTGG TGTCTTTCCC CAAATTAGGA GCCTGTGTGG 
AGTG 3' (SEQ ID NO:30); 

SME-5- 5' GGGCAGGGAG GGGCGCCAGC G 3' (SEQ IDNO:31); 

pSME-5- 5' GGGCAGGTAC CG AATT CAGC G 3' (SEQ ID NO:32); 

SME-6- 5' GGACGGCAGA GGGGTGACAT CACTGCCTAG GCGGCCG 3' (SEQ ID 

NO:33); 

HCREB/SME-6- 5' GGACGGCAGA GGGG ATC CAT GCCTGCCTAG GCGGCCG 3' 
(SEQ ID NO:34); 

uYYl/SME-6- 5' GGACGGCAGA GGGG ATC CAT CACTGCCTAG GCGGCCG 3' (SEQ 
IDNO:35); 

Spl- 5' CTGGCTAAAG GGGCGGGGCT TGGCCAGCC 3' (SEQ ID NO:36); 
CREB/TCRa- 5' CTCCCATTTC CATGACGTCA TGGTTA 3' (SEQ ID NO:37). 

For cold competition studies, 5 to 100 ng of unlabeled competitor oligonucleotide was 
included in the binding reaction mixture. For antibody supershift studies, 1 pi of either rabbit 
preimmune, affinity purified rabbit or mouse IgG (Santa Cruz), a-SRF rabbit polyclonal 
antiserum (Santa Cruz. sc-335X), a-Spl rabbit polyclonal IgG (Santa Cruz, sc-059X). a-YYl 
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rabbit polyclonal IgG (Santa Cruz, sc-281X), ot-CREB-1 mouse monoclonal IgG2 (Santa 
Cruz, sc-271), a-ATF-1 mouse monoclonal IgA (Santa Cruz, sc-243), ct-AP2 rabbit 
polyclonal IgG (Santa Cruz, SC-184X), or a-GATA-4 rabbit polyclonal IgG (Ip et al, 1994) 
was incubated with the indicated nuclear extract at 4°C for 20 minutes prior to the binding 
5 reaction as described previously (Ip et al., 1994). 

Plasmids 

To assess the function of each of the six nuclear protein binding sites identified within 
the SM22a promoter, a series of SM22a mutant promoter-luciferase reporter plasmids were 
generated by PCR™-mediated site directed mutagenesis as described previously (Morrisey et 
10 al , 1 996). The rous sarcoma virus (RS V) LTR-dri ven luciferase reporter plasmid, pRS VL, and 
the pMS Vpgal reference plasmid have been described previously (Parmacek et al. , 1 992). The 
promoterlesspGL2-Basicplasmid (Promega, Madison, WI) served as the cloning backbone for 
all of the luciferase reporter plasmids described herein. The p-5000/1 1 SM221uc plasmid, 
contains 5-kb of SM22a 5* flanking sequence, the untranslated SM22a first exon, the SM22a 

15 first intron and the first 12-bp of exon 2 of the SM22a gene subcloned 5' of the luciferase 
reporter gene. It was constructed by first subcloning the 8.5 kb BamHIIHindlll SM22a 
genomic subfragment (containing 5-kb of 5* flanking sequence, exon 1 and 3. 5-kb of intron 1) 
into Bgl III Hindi II digested pGL2-Basic vector. Next, a 488-bp PCR™-generated Hindlll- 
linkered SM22oc genomic subfragment, including at its 5' end the SM22ct intron 1 HindHI 

20 restriction site, and running to bp +76 of the SM22 cDNA (which includes 12-bp of exon 2) was 
subcloned into the ///^///-digested vector and its correct orientation (5' to 3' relative to the 
luciferase reporter gene) confirmed by DNA sequence analysis. The p-5000SM221uc plasmid, 
containing 5-kb of SM22a 5* flanking sequence subcloned 5' of the luciferase reporter gene, 
was constructed by first subcloning the 2.2-kb BamHI/EcoRI SM22a genomic subfragment 

25 (corresponding to bp -5000 to -2800) into BamHI/EcoRI-dlgesled pBluescript IIKS (Stratagene 
La Jolla, CA). Next, the 1250-bp EcoRIINcol SM22a genomic subfragment corresponding to 
bp -1338 to -89 and the 130-bp PCR™-generated genomic subfragment containing bp -88 
(including the Ncol site at its 5' end) to +41 (including a HindHI linker at its 3' end) was ligated 
into the EcoRI/HindlH-digQsied vector. Then, the 1 .4-kb EcoRI SM22a genomic subfragment 
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(corresponding to bp -2800 to -1339) was subcioned into the EcoRI-digesled plasmid and its 
orientation confirmed by DNA sequence analysis. Finally, the resulting SM22ct genomic 
subfragment corresponding to bp -5 kb to +41 was excised from the Bluescript phagemid with 
BamHI and HindlU and subcioned into Bglll/Hindlll-digested pGL2-Basic. The p- 
1338SM221uc plasmid containing the 1379-bp SM22a genomic subfragment (bp -1338 to +41) 
subcioned 5' of the luciferase reporter in the pGL2-Basic vector, was constructed using the 
1250-bp EcoRl/Ncol SM22a genomic subfragment (bp -1 338 to -89) and the 1 30-bp (bp -88 to 
+41) PCR™-generated genomic subfragments described herein. The p~441SM221uc plasmid 
contains the 482-bp (bp -44 1 to +4 1 ) PstllHindlU SM22cc genomic subfragment subcioned into 
BglII/HindIII-d\gested pGL2-Basic plasmid. The p-300SM221uc and p-1 62SM221uc luciferase 
reporter plasmids, respectively, contain the PCR™-generated bp -300 to +4L and -162 to +41 
SM22a genomic subfragments (including synthetic XhoJ (5' end) and HindlU (3 r end) linkers), 
subcioned into XhoI/HindJII-digestcd pGL2-Basic vector. All PCR™ -generated genomic 
subfragments were confirmed by dideoxy DNA sequence analysis. 

The following SM22a mutant promoter-luciferase reporter plasmids were generated 
and named according to the specific nuclear protein binding site (or sites) within the promoter 
that was mutated (mutated nucleotides within each nuclear protein binding site are 
underlined): 

p-441 SM22|iSME-l 5' AAGGAAGGGT TTCAGGGTCC TGCCCATAGA TCTTTTTTCC 
CGGCCGC 3' (SEQ ID NO:38); 

p-441SM22u.SME-2 5' CCGCCCTCAG CACCGCGGAT CCCCGACCCC CGCAGCATCT 
CCG 3' (SEQ ID NO:39); 

p-441SM22uSME-3 5' CTCGGATCCA TGC TAGC AAT GAATTCGGCT GCCCCCG 3' 
(SEQ IDNO:40); 

p -441SM22uSME-4 5' TCCAACTTGG TGTCTTTCCC CAAGGATCCA GCCTGTGTGG 
AGTG3' (SEQ ID NO:41); 

p-441SM22,.iSRF/SME-4 5' TCCAACTTGG TGTCTTTCCC CGGATATGGA 
GCCTGTGTGG AGTG 3' (SEQ ID NO:42); 

•p-441SM22nYYl/SME-4 5' TCCAACTTGG TGTCTTTCCC CA A ATTAGG A 
GCCTGTGTGG AGTG 3* (SEQ ID NO:43); 
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p-441SM22^iSME-5 5' GGGCAGGTAC CG AATT CAGC G 3' (SEQ ID NO:44); 
p-441SM22|iCREB/SME-6 5' GGACGGCAGA GGGGATCCAT GCCTGCCTAG 
GCGGCCG 3' (SEQ ID NO:45); 

p-441SM22^iYYl/SME-6 5' GGACGGCAGA GGGGATCCAT CACTGCCTAG 

GCGGCCG 3' (SEQ ID NO:46). 

In addition, several SM22a promoter-luciferase reporter plasmids were subcloned that 
contain mutations in two c/s-acting sequences in the SM22ce promoter sequence, p- 
441SM22fiCArG contains the mutations described above in the SME-1 and SME-4 sites, and 
p-441SM22fiSME2/5 contains the mutations described above in the SME-2 and SME-5 sites. 
Each PCR™-generated SM22a promoter mutant was confirmed by DNA sequence analyses 
as described previously (Parmacek et at., 1992). 

To identify functionally important c/s-acting elements that control the expression of 
the SM22a gene in vivo, four transgenic vectors were cloned each of which encodes the 
bacterial lacZ reporter gene under the transcriptional control of the native or mutated SM22a 
promoter fragments. The p-5000SM22-focZ, p-441 SM22-lacZ plasmid, the p- 
441SM22fiCArG-/acZ, and p-280SM22-/acZ plasmids, contain the 5-kb SM22a promoter, 
the 441 -bp SM22a promoter, the 441 -bp SM22a promoter with mutations in SME-1 and 
SME-4 (that abolish binding of SRF), and the 280-bp SM22a promoter, respectively, 
subcloned immediately 5' of the bacterial lacZ reporter gene in a modified pBluescript IIKS 
(Stratagene) plasmid. 
Transfections and Luciferase assays 

1 x 10 6 passage three primary rat aortic SMCs, C2C12 myotubes and A7r5 cells, 
respectively, were split and plated 24 hours prior to transfection and transfected with either 50 
or 100 fig of Lipofectin reagent (Life Technologies, Gaithersburg, MD), 15 fig of luciferase 
reporter plasmid and 5 fig of the pMSVJJgal reference plasmid as described previous!) 
(Parmacek et aL, 1992; Ip et al. ? 1994; Solway et al, 1995. I x 10 6 NIH 3T3 or COS-7 were 
transfected with 20 |tig of Lipofectin reagent, 15 ^ig of the luciferase reporter plasmid and 5 
fig of the pMSVPgal reference plasmid as described previously (Ip et al, 1994; Forrester et 
J- Am. Coll CardioL. 1 7:758-769, 1 991). 1 x 1 0 6 Hep G2 cells were transfected using 360 
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|ig of Lipofectarnine reagent (Life Technologies, Gaithersburg, MD), 26 jig of luciferase 
reporter plasmid and 9 \ig of the pMSVPgal reference plasmid. Following transfection, cell 
lysates were prepared, normalized for protein content and luciferase and p-galactosidase 
assays were performed as described previously (Parmacek et ai 9 1992). All studies were 
5 repeated at least three times to assure reproducibility and permit the calculation of standard 
errors. Luciferase activities (light units) were corrected for variations in transfection 
efficiencies as determined by assaying cell extracts for P-galactosidase activities. Data are 
expressed as normalized light units + S.E.M. 

Transgenic mice 

10 Transgenic mice were produced harboring the p-5000SM22-/acZ, p-44 1 SM22-/acZ. 

p-441SM22jaCArG-/tfcZ and p-280SM22-/acZ transgenes according to standard techniques 
as described previously (Metzger et al, Proc. Natl. Acad. USA, 90:9036, 1993). To identify 
transgenic founder mice, Southern blot analysis was performed using the radiolabeled lacZ 
probe and high molecular weight DNA prepared from tail biopsies of each potential founder. 

15 The number of copies per cell were quantitated by comparing the hybridization signal 
intensity (DPM) to standards corresponding to 1, 10 and 100 copies/cell using a Molecular 
Dynamics Phosphorlmager™. At least four independent founder lines containing each 
transgene were identified as described previously (Parmacek and Leiden, 1989). Transgenic 
embryos (less than ED 15.5) and tissue sections from adult mice were fixed, stained for P- 

20 galactosidase activity and counter-stained with hematoxylin and eosin as described previously 
(Lin et aL, Circulation, 82:2217-2221, 1990), except that 0.02% NP-40 was added to PBS 
during the fixation of whole mount embryos. In addition, to visualize the arterial system of 
mouse embryos, following staining for p-galactosidase activity, embryos were dehydrated in 
methanol for 24 h and cleared in 2:1 (V/V) benzyl benzoate:benzyl alcohol for 2 h. 



25 
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Example 1 
Isolation and Structural Characterization 
of the Murine SM22a cDNA 

5 Murine SM22a cDNA clones were isolated using the polymerase chain reaction in 

conjunction with synthetic oligonucleotide primers derived from the previously published 
sequence of the rat SM22a cDNA (Nishida et al y 1 993). The nucleotide sequence of the full- 
length murine SM22a cDNA is designated herein as SEQ ID NO:8. The murine SM22a 
cDNA encodes a 201 -amino acid polypeptide, SEQ ID NO:9, with a predicted molecular 
10 mass of 22.5 kDa. It is composed of a 76-bp 5' untranslated region, a 603-bp open reading 
frame, and a 403-bp 3* untranslated region. Twenty three base pairs 5' of the poly(A) tail 
there is an A/T rich sequence (AATATA) which may function as the polyadenylation signal. 

A comparison of the coding sequences of the murine and human SM22oc cDNAs 
(Shanahan et al ,1993) demonstrated that the two sequences are 91% and 97% identical at the 

15 nucleotide and amino acid levels, respectively. In addition, a comparison of the coding 
sequences of the murine SM22a cDNA and the murine smooth muscle thin filament 
regulatory protein, calponin (Strasser et al, Genbank Direct Submission Accession Number 
Z 19542, 1992), demonstrated that these two sequences are 23% identical and 32% conserved 
at the amino acid level. Interestingly, the protein sequence encoded by the murine SM22a 

20 cDNA exhibits partial sequence identity with the sequence of the Drosophila muscle protein 
mp20 (Lees-Miller et al 9 J. Biol Chem., 262:2988-2993, 1987) across the entire cDNA. 
suggesting that these two proteins may have evolved from a common ancestral gene. Two 
domains were particularly well conserved between these proteins. One domain with 14/19 
amino acid identity (corresponding to amino acids 104-122 of the murine SM22a protein) 

25 may represent a calcium binding domain oriented in an EF hand conformation (Kretsinger. 
CRC Crit. Rev. Biochem., 8:1 19-174, 1980). The second C-terminal conserved domain with 
13/24 amino acid identity (corresponding to amino acids 158-181 of the murine SM22a 
protein) is a domain of unknown function. 
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SM22a Is Encoded by a Single Copy Gene 

The finding of a putative calcium binding domain oriented in an EF hand 
conformation suggested that SM22a might be related to other members of the troponin C 
supergene family of intracellular calcium binding proteins including slow/cardiac troponin C, 
fast skeletal troponin C, calmodulin, myosin light chain and parvalbumin (Kretsinger. 1980). 
In order to determine whether SM22a is encoded by a single copy gene in the murine genome 
and whether SM22a is related to other troponin C supergene family members, the murine 
SM22ot cDNA was used to probe Southern blots containing murine genomic DNA under both 
high and low stringency conditions. Under high stringency conditions, the murine SM22a 
cDNA probe hybridized to one or two BamHJ, EcoRI, Hindi]/, PstI and Xbal bands, 
suggesting that SM22a is a single copy gene in the murine genome. Interestingly, no 
additional bands were demonstrated under low stringency conditions, suggesting that 
although the SM22a gene may have one EF hand calcium binding domain, it is not closely 
related to other members of troponin C supergene family. 

Example 2 
Expression of the SM22ot Gene 

Previous studies have suggested that SM22oc protein is expressed solely in smooth 
muscle-containing tissues of the adult and may be one of the earliest markers of the smooth 
muscle cell lineage (Gimona et aL, Eur. J. Biochem., 205:1067-1075, 1992; Duband et aL. 
Differentiation, 55 (1):1-1 1, 1993; Nishida et al, 1993). To determine the in vivo pattern of 
SM22a gene expression, the SM22a cDNA was hybridized to Northern blots containing 
RNAs prepared from 12-week old murine tissues. The murine SM22ct cDNA probe 
hybridized to one predominant mRNA species of approximately 1.2-kb. SM22a mRNA is 
expressed at high levels in the smooth muscle-containing tissues of aorta, small intestine, 
lung, spleen and uterus. In addition, prolonged autoradiographic exposures revealed very 
low, but detectable, levels of SM22cc mRNA in heart, kidney, skeletal muscle and thymus. 

In order to determine the cell-specificity of SM22a gene expression, the SM22a 
cDNA probe was hybridized to northern blots containing RNAs prepared from rat aortic 
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vascular SMCs, the rat SMC line A7r5, murine NIH 3T3 and C3H10T1/2 fibroblasts, the 
SV40-transformed monkey kidney cell line COS-7, murine C2C12 myoblasts and myotubes, 
the human hepatocellular carcinoma cell line Hep G2 and the murine lymphoid cell line EL4. 
High levels of SM22a mRNA were detected in primary rat aortic vascular SMCs and the 
smooth muscle cell line A7r5. Detection of a second 1.5 kb species of mRNA represents 
cross hybridization of the SM22a probe to the murine calponin mRNA. In addition, SM22ct 
mRNA was expressed in both undifferentiated C2C12 myoblasts and terminally- 
differentiated C2C12 myotubes. Finally, a faint hybridization signal was detectable in NIH 
3T3, C3H10T1/2, and Hep G2 cells after a 3-day autoradiographic exposure. Quantitative 
Phosphorlmager™ analysis of these low level hybridization signals revealed that SM22ct 
mRNA is expressed in these three non-myogenic cell lines at less than 1.5% the intensity of 
SM22ct gene expression in A7r5 and primary SMCs. Thus, in addition to primary SMCs and 
SMC lines, SM22ct mRNA is expressed in other embryonic skeletal muscle cell lineages such 
as C2C12 myoblasts and myotubes, but not in other non-myogenic cell lineages. 

SM22cx Is Expressed in Both Cell Cycle Arrested and Proliferating SMCs 

Within the tunica media of the arterial wall the vast majority of vascular SMCs are 
maintained in a non-proliferating, quiescent state and express contractile proteins (Owens et 
al, 1986; Rovner et al, 1986; Taubman et al, 1987; Ueki et al, 1987; Gimona et al, 1990; 
Shanahan et al, 1993; Ross, Nature, 362:801, 1993; Forrester et al, 1991). However, in 
response to vascular injury, SMCs migrate from the tunica media to the intimal layer, 
proliferate and assume a "synthetic phenotype" (Ross, 1986; Schwartz et al, 1986; Zanellato 
et al, 1990; Ross, 1993; Forrester et al, 1991; Schwartz et al, 1992; Liu et al, 1989). 
Previous studies have demonstrated that many genes encoding vascular SMC contractile 
proteins are down-regulated during this process (Owens et al, 1986; Rovner et al, 1986; 
Ueki et al, 1987; Gabbiani et al, Proc. Natl. Acad. Sci. USA 78:298, 1981). Thus, the 
SM22a gene may be unique in that its expression is not differentially regulated during 
progression through the cell cycle. In order to address this question, cultures of low passage 
number primary rat aortic SMCs were synchronized in the G 0 /G, stage of the cell cycle by 
serum starvation for 72 hrs. FACS analyses revealed that under these conditions 
approximately 90% of cells are arrested in G 0 /G, (Chang et al, 1995a). The cells were then 
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serum-stimulated and RNA was prepared from replicate cultures at the time of serum 
stimulation (t 0 ), and at 8 hrs, 12 hrs, 16 hrs, and 24 hrs post-stimulation. After serum 
stimulation, the arrested vascular SMCs begin to pass through the G,/S checkpoint of the cell 
cycle at approximately 12 hrs and by 24 hrs post-stimulation greater than 50% of cells are in 
the S and G 2 /M phases of the cell cycle (Chang et al, 1995a). A northern blot analysis 
demonstrated no differences in SM22a gene expression in cell cycle arrested versus 
proliferating SMCs as assessed by quantitative Phosphorolmager™ analysis of the 
hybridization signal. Thus, in contrast to other smooth muscle contractile proteins, such as 
smooth muscle myosin heavy chain (Rovner et al. y 1986), smooth muscle ct-actin (Owens et 
al. y 1986) and calponin, SM22a appears to be constitutively expressed at high levels in both 
quiescent and proliferating vascular SMCs. 

Example 3 

Isolation and Structural Characterization of 
a SM22a Genomic Clone 

A full length murine SM22a genomic clone of 20-kb was isolated by screening a 
murine 129SV genomic library with a SM22a cDNA probe under high stringency conditions. 
Exons were identified by hybridization with specific cDNA fragments and their boundaries 
confirmed by DNA sequence analysis. The nucleic acid sequence of the genomic clone is 
designated herein as SEQ ID NO:l, containing exon 1, SEQ ID NO:2, containing exons 2, 3 
and 4, and SEQ ID NO:6, containing exon 5. There is approximately a 4 kb gap between 
SEQ ID NO:l and SEQ ID NO:2, and approximately a 450 base gap between SEQ ID NO:2 
and SEQ ID NO:6. The amino acid sequences are encoded by exons 2, 3 and 4 and are 
designated herein as SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5 and SEQ ID NO:7. The 
murine SM22a gene is composed of five exons spanning 6.2-kb of genomic DNA. 

The transcriptional start site of the SM22a gene was identified by RNase protection, 
primer extension and 5' RACE PCR™ analyses. Primer extension analyses utilizing an 
antisense synthetic oligonucleotide corresponding to bp 80-104 of the SM22a cDNA resulted 
in a major extended product of 104-bp (arrow) which was generated at reaction temperatures 
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up to 56°C. In addition, 5' RACE PCR™ was performed utilizing an antisense 
oligonucleotide primer corresponding to bp 234-258 of the SM22a cDNA. DNA sequence 
analyses of eight random 5' RACE clones revealed a transcriptional start site 76-bp 5' of the 
initiation codon in seven of eight clones and 72-bp 5' of the initiation codon in one of eight 
clones. RNase protection analyses were also performed using an antisense cDNA probe 
corresponding to bp -88 - +44 of the SM22a genomic sequence as deduced by DNA 
sequence and Southern blot analyses. These analyses revealed a major protected fragment of 
44-bp (arrow) corresponding to a transcriptional start site 76-bp 5' of the initiation codon. In 
addition, a second, minor (20% relative signal intensity) protected fragment of 54-bp was 
also demonstrated. Taken together, these data allowed the identification of the major 
transcriptional start site of the murine SM22ct gene 76-bp 5' of the initiation codon. 

The complete coding sequence and 1339-bp of 5* flanking sequence of the SM22a 
gene was determined and each of the splice junctions conforms to the consensus splice donor- 
acceptor patterns as described by Breathnach and Chambon (Breathnach et aL, Annu. Rev. 
Biochem, 50, 349-383, 1981). In order to identify potential transcriptional regulatory 
elements, 1339-bp of 5* sequence flanking the cap site was searched for a variety of 
transcriptional regulatory elements using MacVector DNA sequencing software (Kodak/IBI). 
The nucleotide sequence TTTAAA, which might function as a TATA box was present 29-bp 
5' of the start site. A consensus CAAT box was not identified in the immediate 5' flanking 
region of the SM22ct gene. A computer homology search for previously described muscle- 
specific and/or skeletal or cardiac muscle lineage-restricted transcriptional regulatory 
elements revealed five consensus E boxes/bHLH myogenic transcription factor binding sites 
(CANNTG [Olson, Genes Dev., 4, 1454-1461, 1990; Tapscott et al, J. Clin. Invest., 87:1133- 
1138, 1991; Lassar et ai 9 Cell, 58 (5):823-31, 1989]) located at bps -534, -577, -865, -898, - 
910, and -1267, three consensus GATA-4 binding sites (WGATAR [Evans et al, Proc. Natl. 
Acad. Sci. (USA), 85:5976-5980, 1988]) located at bps -504, -828, -976, and two AT-rich, 
potential MEF-2/rSRF binding sites (YTAWAAATAR, SEQ ID NO: 13 [Gossett et al., Mol 
Cell. Bio/.. 9:5022-5033, 1989]) located at bps -407 (TTtAAAATcG, SEQ ID NO:14, small 
letters denote mismatches from the consensus MEF-2 sequence) and -770 (TTcAAAATAG, 
SEQ ID NO: 15). In addition, functionally important nuclear protein binding sites which have 
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been identified in previously characterized skeletal and cardiac-specific transcriptional 
regulatory elements included two consensus CArG/SRF binding sites (Minty el ai, MoL Cell. 
Biol. 6:2125, 1986) located at bps -150 and -273 and one CACC box (Dierks et ai y Cell, 
32:695-706, 1983) located at bp -104. Finally, four AP2 (CCCMNSSS, SEQ ID NO:16 
5 [Mitchell et aL, Cell, 50:847-851, 1987]), one Spl (KRGGCKRRK, SEQ ID NO: 17 [Dynan 
et aL 9 Cell, 35:79-87, 1983]), and two NF-IL6 (TKNNGNAAK, SEQ ID NO:18 [Akira et al. 9 
EMBOJ, 9 (6): 1897-906, Cell, 35:79-87, 1990]) binding sites were located in the 5' flanking 
region. 

1 0 Example 4 

Identification of the cis~ Acting Transcriptional Regulatory 
Elements That Control SM22a Gene Expression 

In order to identify the functionally important c/s-acting sequences that regulate 
15 transcription of the SM22a gene in SMCs, a series of transient transfections were performed 
using SM22a-luciferase reporter constructs and primary rat aortic vascular SMCs and the 
SMC line, A7r5, both of which express high levels of SM22oc mRNA. Transfection of A7r5 
cells with the plasmid p-5000/HSM221uc, containing 5-kb of 5' flanking sequence and the 
entire 4-kb SM22a intron 1 sequence (the initiation codon is located in exon 2) ? resulted in a 

20 250-300-fold induction in luciferase activity as compared to the promoterless control plasmid, 
pGL2-Basic. This level of transcriptional activity was comparable to that obtained following 
transfection of A7r5 cells with the RSV-containing luciferase reporter plasmid, pRSVL. In 
order to determine whether this transcriptional activity was due to the immediate 5' flanking 
region of the SM22a gene, or alternatively, was due to a transcriptional regulatory element 

25 located within the first intron of the SM22a gene, the activities of the p-5000/HSM221uc and 
p-5000SM221uc plasmid were compared. Transfection of A7r5 cells with the p- 
5000SM221uc plasmid, containing only 5-kb of 5' flanking sequence, resulted in high-level 
transcription of the luciferase reporter gene comparable (on a molar basis) to levels obtained 
with the p-5000/1 1 SM221uc plasmid. Thus, the 5' flanking region of the SM22a gene 

30 contains c/.y-acting sequence elements required for high-level transcription in A7r5 cells. 
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To further localize the 5' flanking elements of the SM22a gene that direct high-level 
expression in SMCs, a series of 5' deletion mutants were transfected into both A7r5 cells and 
primary cultured rat aortic vascular smooth muscle cells. In both A7r5 cells and primary 
vascular SMCs, the p-441SM221uc plasmid, containing 441 -bp of 5' flanking sequence, 
5 increased transcription of the luciferase reporter to levels comparable to the p-5000SM221uc 
plasmid and the p-1338SM221uc plasmids. However, transfection of both A7r5 cells and 
primary vascular SMCs with the luciferase reporter plasmids p-300SM221uc and p- 
162SM221uc containing 300-bp and 162 -bp, respectively, of 5' flanking sequence resulted in 
50% and 90% reductions in normalized luciferase activities as compared with those obtained 
10 with the p-441SM221uc. These data demonstrated that 441 -bp of SM22a 5* flanking 
sequence, containing the endogenous SM22oc promoter, is sufficient to direct high-level 
transcriptional activity in both A7r5 cells and primary rat aortic SMCs. 

Example 5 

1 5 Cellular-Specificityof the SM22a Promoter 

In order to characterize the cellular-specificity of the SM22a promoter sequence, the 
transcriptional activities of the 441 -bp SM22a promoter containing plasmid, p-44 1 SM221uc, 
was compared to the positive control plasmid containing the rous sarcoma virus LTR, 

20 pRSVL, in primary rat vascular SMCs, the smooth muscle cell line A7r5, NIH 3T3 
fibroblasts, COS-7, and Hep G2 cells. Consistent with the lineage-restricted pattern of 
SM22a mRNA expression demonstrated in these cell lines, the promoter-containing plasmid, 
p-441SM221uc, was active in primary rat aortic SMCs and A7r5 cells, increasing 
transcription of the luciferase reporter gene approximately 2500-fold and 540-fold, 

25 respectively, over that induced by transfection with the promoterless pGL2-Basic plasmid 
(FIG. 1). This level of promoter activity was comparable to levels obtained following 
transfection of these cells with the RSV LTR-driven positive control plasmid (FIG. 1). In 
contrast, the 441 -bp SM22ct promoter was inactive in NIH 3T3, COS-7 and Hep G2 cells 
(FIG. 1). 
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DNA sequence analyses revealed that this 441 -bp promoter contains two CArG/SRF 
boxes (Minty et al , 1986), a CACC box (Dierks et al, 1983), and one A/T-rich, potential 
MEF-2/rSRF binding site (Gossett et al, 1989), c/.v-acting elements which have each been 
demonstrated to be involved in the transcriptional programs that regulate skeletal and cardiac 
muscle-specific gene expression. However, unlike most previously described skeletal 
muscle-specific transcriptional regulatory elements, this sequence lacked a canonical E box 
binding site for the myogenic bHLH transcription factors (Tapscott et al, 1991 ; Lassar et al, 
1989). Thus, the endogenous 441 -bp SM22a promoter contains all of the exacting sequence 
elements required to recapitulate the smooth muscle lineage-restricted pattern of SM22a gene 
expression demonstrated in vivo. 

Example 6 

Generation of SM22a~Pgal Transgenic Mice 

A reporter construct was first prepared in which the 441 -bp minimal SM22ct promoter 
was subcloned immediately 5' of the bacterial p-galactosidase reporter gene (lacZ). The 
transgenic vector was generated from a pBluescript-KS phagemid containing Ascl restriction 
sites flanking the polylinker sequence. This construct is referred to herein as -44 1 SM22focZ. 
The transgene was microinjected into oocytes that were transplanted into pseudo-pregnant hosts 
as described in Metzger et al, 1993 (incorporated herein by reference). To identify transgenic 
founder mice, Southern blot analysis was performed using the radiolabeled lacZ probe and high 
molecular weight DNA prepared from tail snips of each potential founder pup. The 
radiolabeled lacZ cDNA probe hybridized to the expected 4.2 kb BamHJ-digested band in 4 of 
17 pups analyzed. The four founders contained between 5 and 160 copies per cell as assessed 
by comparing the hybridization signal intensity (DPM) to standards corresponding to 1 , 1 0 and 
1 00 copies per cell using a Molecular Dynamics Phosphorlmager™. 

The Fl -441SM22/acZ#14 male was crossed with a CD-I female and El 1.5 embryos 
from this litter were isolated, genotyped (using PCR™) r fixed and stained for p-galactosidase 
activity. Transgenic embryos were easily distinguished from their non-transgenic litter mates 
by the obvious blue staining along their distal somites. This pattern correlated with the 
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transient pattern of SM22a gene expression observed in the developing somites. In EDI 1.5 
embryos, the endogenous SM22a gene is expressed throughout the primitive heart tube, 
developing somites, dorsal aorta and the forming branch arteries (Li et aL. 1996a). Whole 
mount staining of EDI 1.5 embryos demonstrated high level p-galactosidase activity 
throughout the developing arterial system. Blue staining was observed throughout the dorsal 
aorta, the carotid and vertebral arteries, the cerebral arteries, the umbilical arteries and the 
aortic arches. A high power section through the iliac artery, demonstrated that expression of 
the lacZ transgene was restricted to 1-2 layers of cells underlying the arterial endothelium. In 
addition, p-galactosidase activity was detected within the myotomal component of the 
developing somites and within the bulbo-truncus region (future outflow tract) and at low 
levels within the bulbo-cordis region (future right ventricle) of the primitive heart. 
P-galactosidase activity was not detected within the future left ventricle, left atrium or right 
atrium at this stage of embryonic development. Surprisingly, although the SM22a gene is 
expressed at high levels in smooth muscle cells lining the pulmonary airways, as well as 
within the gastrointestinal and genitourinary tracts, no p-galactosidase activity was detected 
in the developing lung buds, gastrointestinal mucosa, or the uterine or bladder mucosa during 
late murine embryogenesis or postnatal development. Thus, the 441 bp SM22a promoter is 
necessary and sufficient to activate transcription in vascular SMC's in a lineage-restricted 
fashion in transgenic mice. In addition, this minimal promoter element contains c/.v-acting 
sequences required to activate transcription of the SM22a gene in the developing somites. 
These data also demonstrate that SM22a gene expression is regulated at the level of 
transcription. 

Furthermore, the normalized luciferase activity obtained with the 300-bp promoter 
was still 100-fold above that obtained with promoterless control plasmids in these transient 
transfection assays. To determine whether a 280-bp SM22a promoter fragment (bp -280 - 
+41) was sufficient to direct arterial SMC-specific gene expression, the inventors produced 
eight independent lines of transgenic mice in which the lacZ gene was placed under the 
transcriptional control of the 280-bp SM22oc promoter. These mice contained between 2 and 
34 copies of the transgene per cell. The 280-bp of 5' flanking sequence was sufficient to 
direct high level P-galactosidase activity (blue staining) to arterial SMCs and the myotomal 
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component of the somites of EDI 1.5 mice. Virtually identical patterns of transgene 
expression were demonstrated in 4 independent lines analyzed at EDI 1.5 in which copy 
numbers varied between 2 and 34 copies per cell. Interestingly, dense blue staining was 
detected within the cardiac outflow tract (a neural crest derivative) while a somewhat patchy 
5 pattern of blue staining was present in the developing arterial system (which is derived from 
lateral mesoderm and neural crest). Higher power sections confirmed that virtually every cell 
within the cardiac outflow tract stained blue. Interestingly, dense blue staining was detected 
within the mesenchymal cells that compose the aorticopulmonary spiral septum which is 
present at EDI 1.5. In addition, most, but not all, cells underlying the epithelium of the 

10 developing arteries stained blue. Taken together, these data demonstrate that the 280-bp 
SM22cc promoter is sufficient to program lineage-restricted transcription in arterial SMCs and 
the developing somites. However, in contrast to the endogenous pattern of SM22a gene 
expression, the 441 -bp (and 280-bp) SM22a promoter does not contain the c« -acting 
elements that control SM22a transcription in either visceral (gastrointestinal, uterine, bladder, 

15 and bronchial) or venous SMCs nor in the primitive heart tube. Finally, the inventors 
observed virtually the- same arterial SMC-specific pattern of expression using the 5000-bp 
SM22a promoter in transgenic mice. These data strongly suggest that distinct transcriptional 
programs distinguish tissue-restricted subsets of SMCs (even within the vasculature). 

Xgal tissue staining 

20 The lung, heart, liver, kidney, spleen, testis or ovary, and skeletal muscle are excised 

from euthanized animals, and stained to reveal p-galactosidase activity. If P-galactosidase 
activity is evident in non-transgenic mice, the transgenic lines are generated using a nuclear 
localizing P-galactosidase isoform to minimize false-positive staining (Hughes and Blau, 
1990). To reveal p-galactosidase activity, tissues are washed in PBS, then fixed in 1.25% 

25 glutaraldehyde (lung is fixed as below). After washing in Ca +2 - and Mg +2 -free buffer, tissues 
are incubated overnight in the dark in Xgal solution (50 mM Tris HC1 pH 7.5, 2.5 mM 
potassium ferriferrocyanide, 15 mM NaCL 1 mM MgCl 2 , 0.5 mg/ml Xgal), then paraffin 
embedded; 4 micron sections are counterstained with eosin. 



WO 98/15575 



PCT/US97/16204 



-42 - 

Data analysis 

The tissuq and cellular distribution of Xgal staining, reflecting SM22a promoter 
transcriptional activity, is recorded for each transgenic line studied, and compared 
qualitatively among experimental conditions. Quantitative assessment of lung and tracheal 
5 SM22a promoter transcriptional activity is also performed by RNase protection assay for 
lacZ mRNA, which is compared among study groups using ANOVA followed by multiple 
range testing. To test whether potential differences in lacZ mRNA levels might stem from 
different amounts of smooth muscle among groups, airway smooth muscle area vs. 
circumference curves is compared between groups as described by James et al (1989): 
10 pulmonary arterial area vs. circumference curves are likewise compared. 

Example 7 
Expression of SM22oc in Lung 

SM22cc mRNA by is detected in the lungs by in situ hybridization. A digoxigenin- 
labeled cRNA corresponding to the reverse complement of mouse SM22a cDNA bp 644 to 
1007 was prepared by in vitro transcription (MaxiScript™ Kit, Ambion, and Genius™ 4 Kit. 
Boehringer Mannheim). In situ hybridization was performed on a lung specimen obtained at 
autopsy from a patient without lung disease. Hybridized probe is detected 
immunohistochemically with an anti-dioxigenin antibody linked to alkaline phosphatase. The 
SM22ct cRNA binds selectively to airway smooth muscle and to pulmonary vascular smooth 
muscle; black anthracotic pigment was also evident in this specimen (typical of urban 
dwellers). 

25 Example 8 

Adenovirus Mediated Expression of a Heterologous Gene Product in vitro 

The 441 -bp murine SM22a promoter has been shown previously to program arterial 
SMC-specific gene expression in transgenic mice (Kim et ai, 1997; Li et aL, 1996b: 
30 Moessler et ai, 1996). To test whether the SM22a promoter could be utilized to restrict the 
expression of a recombinant gene product encoded by a RDAd to SMCs, a RDAd (AdSM22- 
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lacZ) was constructed containing the bacterial lacZ reporter gene under the transcriptional 
control of the murine SM22ot promoter (FIG. 3A, upper panel). In the studies described here, 
the activity of AdSM22-lacZ was compared to that of the control virus, AdCMV-lacZ 
(Tripathy et al, Proc. Natl Acad. ScL USA, 91:11557-11561, 1994), in which the bacterial 
5 lacZ reporter gene is under the transcriptional control of the ubiquitously active 
cytomegalovirus (CMV) immediate early gene promoter/enhancer (FIG. 3A. lower panel). 

To assess the activity of AdSM22-lacZ in cells transduced in vitro, replicate cultures 
of primary rat aortic SMCs were infected with 1-, 10- and 100-plaque forming units 
(PFU)/cell of either AdSM22-lacZ or AdCMV-lacZ and the fraction of cells expressing 

10 histochemically identifiable P-galactosidase activity (as assessed by blue staining with Xgal) 
was quantitated. As shown in FIG. 3B, 12, 80, and 88% of cells expressed the lacZ transgene 
following infection with I-, 10- and 1 00-PFU/ceI 1 , respectively, of AdSM22~lacZ. The 
fraction of cells expressing P-galactosidase was comparable to that observed following 
infection of replicate cultures with the control AdCMV-lacZ virus (FIG. 3B). Consistent with 

15 these findings, 10, 70 and 90%, respectively, of immortalized A7r5 vascular SMCs expressed 
the lacZ transgene following infection with 1-, 10-and 100-PFU/cell of AdSM22-lacZ. This 
efficiency of transgene expression was, again, comparable to that observed following 
infection of this immortalized SMC line with the AdCMV-lacZ control virus. 

To determine whether the SM22a promoter restricted expression of the lacZ reporter 

20 gene to SMCs, primary human umbilical vein endothelial cells (HUVECs) and NIH 3T3 
fibroblasts were infected with AdSM22-IacZ or the AdCMV-lacZ control virus. In contrast to 
the high efficiency of transgene expression observed following AdSM22-lacZ-mediated 
infection of primary and immortalized SMCs (FIG. 3B), p-galactosidase activity was not 
detectable in HUVECs or NIH 3T3 cells following infection with AdSM22-lacZ (FIG. 3B). 

25 In contrast, 10, 60, and 93% of HUVECs expressed histochemically detectable p- 
galactosidase activity following infection with 1-, 10-, and 100-PFU, respectively, of the 
AdCMV-lacZ control virus (FIG. 3B). Similarly, approximately 50% of NIH 3T3 cells 
expressed-detectable P-galactosidase activity following infection with 100-PFU/cell of the 
AdCMV-lacZ control virus. 
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Southern blot analyses of DNA harvested from HUVECs infected 72-h previously 
with AdSM22-lacZ demonstrated the presence of the lacZ transgene in these cells. The 
hybridization signal was comparable in intensity to that obtained with DNA harvested from 
HUVECs infected with the AdCMV-lacZ control virus thereby confirming efficient infection 
of these cells by AdSM22-lacZ. The different sizes of the lacZ hybridizing bands seen in this 
study is consistent with the expected patterns of restriction endonuclease digestion of each 
adenoviral vector with Bglll. Despite the fact that AdSM22-lacZ and AdCMV-lacZ both 
efficiently infected HUVECs, no lacZ transgene mRNA was detected in the AdSM22-lacZ 
infected HUVECs by Northern blot analysis. In contrast, HUVECs infected with the control 
AdCMV-lacZ virus expressed abundant lacZ mRNA. Taken together, these data 
demonstrated that AdSM22-lacZ programs SMC-specific transgene expression in vitro and 
confirmed that the lineage-restricted expression of the transgene was regulated at a 
transcriptional or post-transcriptional level. 

The pAdSM22 plasmid was generated by subcloning the 441 -bp murine SM22a 
promoter (Solway et al 7 1995) into Clal (5' end)/HindIII (3* end)-digested pAdEFl (KN) 
plasmid (Tripathy et ciL, 1994). The pAdSM22-lacZ plasmid was generated by subcloning the 
Hindlll (5' end)/BgIII (3' end)-linkered bacterial lacZ reporter gene into the Hindlll/BamHI- 
digested pAdSM22 plasmid. The AdSM22-lacZ adenovirus encoding the bacterial lacZ 
reporter gene under the transcriptional control of the murine SM22a promoter and the human 
4F2 heavy chain transcriptional enhancer (Karpinski et al. 9 Mol. Cell. Biol, 9:2588-2597 ? 
1989) was generated by recombination in 293 cells between the pAdSM22-lacZ plasmid 
DNA and El- and E3-deIeted Ad5Sub360 genomic DNA digested with Xbal and Clal as 
described previously (Barr et al. 9 1994) The structure of this virus was confirmed by 
Southern blot analyses. The AdCMV-lacZ RDAd encoding the bacterial lacZ reporter gene 
under the transcriptional control of the cytomegalovirus (CMV) immediate early gene 
promoter/enhancer has been described previously (Barr et al. 1994). Recombinant viruses 
were plaque purified three times to avoid contamination with replication-competent virus. 
High titer adenoviral stocks were prepared by infecting 293 cells with 2- to 5-plaque forming 
units (PFU) of virus per cell as described previously (Barr et al. 1994). Titers of each cesium 
chloride purified viral stock were determined from the absorbance at 260 nm (1 absorbance 
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unit= 10'" PFU/ml) and were confirmed by plaque assay as described previously (Barr et al.* 
1994). 

The studies described in this example were performed as follows: Primary rat aortic 
SMCs were isolated from 12- to 16-wk old Sprague Dawley rats and grown as described 
5 previously (Chang et al, 1995a). Virtually all cells stain positive for expression of SM-ot- 
actin when isolated using this technique (Sol way et aL, 1995). In all experiments only third 
passage primary rat aortic SMCs were utilized. Immortalized rat vascular A7r5 SMCs, 
passage 4 human umbilical vein endothelial cells (HUVECs), and mouse NIH 3T3 fibroblasts 
were grown as described previously (Kim et aL, 1997). Cells were placed in medium 
10 containing 2% fetal bovine serum (FBS) and infected with either 1-, 10- or 100-PFU/cell of 
purified adenoviral stocks. Following infection, cells were washed in PBS and placed in 
growth medium containing 10% FBS. 72-h post-infection, cells were harvested for 
preparation of DNA and RNA, or were fixed and stained for P-galactosidase activity with X- 
gal as described (Lin et aL, 1990). The unstained and blue-stained (P-gai + ) cells from 10 

15 representative high power fields were counted in each section and the percentage of p-ga] + 
cells calculated. The data are expressed as % P-galactosidase positive cells ± S.D. All 
experiments involving animals were approved by the University of Chicago Committee on 
Animal Care and Use. The Sprague-Dawley rats were housed and cared for according to NIH 
guidelines in the A. J. Carlson Animal Research Facility at the University of Chicago. 

20 Southern and Northern blot analyses were performed as described previously 

(Parmacek and Leiden, 264:13217-13225, 1989). The polymerase chain reaction (PCR)- 
generated 485-bp bacterial lacZ probe (which corresponds to bp 962-1448 in the pCMVp 
plasmid (Clonetech)) was radiolabeled and used for the Southern and Northern blot analyses. 
Quantitative image analyses were performed using a Molecular Dynamics Phosphorlmager 

25 (Sunnyvale, CA.). 
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Example 9 

Intra-Arterial Administration of RDAd in Uninjured and 
Balloon-Injured Rat Carotid Arteries 

5 After induction of anesthesia and intubation, the left and right carotid arteries of adult 

Sprague-Dawley rats were isolated and a balloon-injury was created by dilatation with a 2F 
Fogarty catheter as described previously (Chang et ctL, 1995a). A 24-gauge intravenous 
catheter was introduced into the lumen of uninjured or balloon- injured arterial segments and 
2 X 10 9 -PFU of AdSM22-lacZ or AdCMV-lacZ was instilled into the isolated arterial 
10 segment for 5 minutes. Seven days following infection, rats were euthanized and the isolated 
segments of carotid artery were removed, fixed in 1 .25% glutaraldehyde, and stained for P- 
galactosidase activity with X-gal as described previously (Lin et al, 1990). Photomicroscopy 
was performed using Kodak EPT 160 film and a Zeiss Axiophot microscope. 

To determine whether the SM22a promoter could be used to restrict adenovirus- 
15 mediated transgene expression to arterial SMCs in vivo, 2 X 10 9 PFU of either AdSM22-lacZ 
or the control AdCMV-lacZ virus were introduced into isolated segments of uninjured and 
balloon-injured rat carotid arteries. Diffuse blue staining of the vascular endothelium was 
observed seven days following administration of the control AdCMV-lacZ virus into the 
uninjured rat carotid artery. In addition, rare cells within the adventitia also stained blue. In 

20 contrast, when AdSM22-lacZ was introduced into the uninjured rat carotid artery, P~ 
galactosidase activity was not observed within either endothelial or adventitial cells. 
However, rare lacZ-expressing SMCs were observed in the superficial (abluminal) layer of 
the tunica media. These data suggested that the SMC-specificity of AdSM22-lacZ transgene 
expression is maintained following intra-arterial administration of AdSM22-lacZ into an 

25 isolated segment of the uninjured rat carotid artery. 

To determine the cell-specificity of transgene expression in balloon-injured rat carotid 
arteries, 2 X 10 9 -PFU of AdSM22-lacZ was instilled into an isolated segment of the rat 
carotid artery for 5 minutes immediately following balloon injury. Seven days post-infection, 
the injured arterial segments were isolated and the pattern of P-galactosidase expression was 

30 compared to that observed in the uninfected balloon-injured contra lateral artery. In contrast 
to the low level P-galactosidase activity observed in the uninjured carotid artery infected with 
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the AdSM22-lacZ virus, higher efficiency gene transfer was achieved in these balloon-injured 
arterial segments. The majority of the SMCs expressing (i-galactosidase activity were located 
within the tunica media. In addition, rare cells within the neointima also stained light blue. 
Consistent with previous reports, gene transfer was preferentially observed in the SMCs 
underlying the site of neointimal proliferation. Finally, lacZ transgene expression was not 
observed in endothelial cells at the margins of the vessel wall injury, where endothelial cells 
remained intact. Taken together, these data demonstrated that the AdSM22-IacZ virus 
maintains its SMC-specific pattern of transgene expression following intra-arterial 
administration and that it can be used to efficiently transduce arterial SMCs in the balloon- 
injured rat carotid artery. 

Example 10 
Intravenous Administration of RDAd 

Intravenous administration of RDAd results in high level gene transfer to the liver and 
lung thereby potentially limiting the utility of these viruses in some clinical settings (Kashyap 
et al., 1995; Johns et ah, 1995; Miller and Vile, 1995). 12-16 week old Sprague-Dawley rats 
were injected intravenously with 10 9 - or 10 ,0 -PFU of AdSM22-lacZ or AdCMV-lacZ, 
respectively. Liver function tests were performed on serum samples obtained 7 days 
following infection using Kodak DT60II and DTSCH automated analyzers. To determine the 
significance of alterations in liver function tests observed between control, AdSM22-lacZ- 
infected. and AdCMV-lacZ-infected rats, Student's t tests were performed. 7-days post- 
injection, rats were euthanized and the injected tissue, as well as the liver, lung, kidney, and 
carotid arteries were isolated, washed, fixed and stained for p-galactosidase activity with X- 
gal as described previously (Lin et aL, 1990). 

LacZ expression was observed throughout the livers of rats infected with the AdCMV- 
lacZ control virus. In addition, focal patches of p-gal+ cells were observed within 
perivascular regions of the lung of AdCMV-lacZ-infected rats. In contrast, seven days after 
infection with AdSM22-lacZ, histological sections of both the liver and lung were 
indistinguishable from those obtained from uninfected control rats. These data suggest that in 
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contrast to RDAd containing virally-driven and/or ubiquitously active transcriptional 
regulatory elements. AdSM22-lacZ restricts transgene expression to SMCs following 
intravenous administration. 

To determine whether intravenous administration of AdSM22-lacZ caused 
abnormalities in liver function despite the finding that the lacZ reporter gene encoded by this 
virus was not expressed in this tissue, adult Sprague Dawley rats were injected intravenously 
(IV) with -10 ,0 -PFU of the AdSM22-lacZ virus and liver function tests were performed on 
serum samples obtained seven-days post-infection. No statistically significant elevations in 
serum alkaline phosphatase (AP), alanine aminotransferase (ALT), aspartate aminotransferase 
(AST),y-glutamyltranspeptidase (GGT), total bilirubin, total protein and albumin were 
observed in rats infected with the AdSM22-lacZ virus (see Table 1) (p > 0.05). However, 
small, but consistent, elevations in the mean serum concentrations of ALT, AST, AP were 
observed. In contrast, statistically significant elevations in ALT and AST serum 
concentrations were observed seven days following intravenous administration of 10 I0 -PFU 
of the AdCMV-IacZ control virus (p < 0.05). In addition, increased serum concentrations of 
AP and bilirubin were observed in rats receiving 1 X 10 l0 -PFU of the AdCMV-lacZ virus (p 
< 0.09). Thus, intravenous administration of high doses of AdSM22-lacZ did result in mild 
elevations in liver function tests. However, the liver function test abnormalities were 
significantly less marked than those observed in rats infected with identical doses of the 
AdCMV-lacZ control virus. 



Table 1 



Control 



10 ,0 SM22-lacZ 



10 ,0 CMV-lacZ 153+16* 



ALT 



61±4 



AST 
9718 
137±21 
166±21* 



AP 
229±38 
306±37 



Bill 
0.110.0 
0.110.0 



4421103** 0.310.2** 



Alb 

2.910.1 
3.710.2 
3.310.2 



Data are expressed as mean 1S.E.M. 
*p<0.05 versus control values 
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**p<0.09 versus control values 

Example 1 1 

Direct Injection of RDAd into Visceral SMCs and Skeletal Muscle 

Direct injection of AdSM22-lacZ into SMC-containing tissues and skeletal muscle 
was performed following induction of anesthesia and intubation as described above. 10 9 -PFU 
of the AdSM22-IacZ virus was injected directly with a 30-gauge needle into the wall of the 
ureter, the bladder wall or intramuscularly. The site of each injection was marked by a suture. 
Seven days after injection, the sites of injection were isolated, fixed and stained for p- 
galactosidase activity as described (Lin et al., 1 990). 

Dense blue staining was observed throughout the longitudinal and circumferential 
layers of SMCs within the wall of the ureter. In contrast, P-galactosidase activity was not 
observed within the epithelial cells lining the lumen of the ureter. Following direct injection 
of AdSM22-lacZ into the bladder mucosa, focal patches of P~gal+ SMCs were observed 
surrounding the site of injection. In contrast, P-galactosidase activity was not observed within 
the bladder epithelium. These data demonstrated that AdSM22-lacZ programs transgene- 
expression in visceral, as well as vascular, SMCs. 

The 441 -bp murine SM22cc promoter is active in embryonic skeletal muscle cells and 
the somites of transgenic mice (Kim et al., 1997). To determine whether AdSM22-lacZ 
programs transgene expression in adult skeletal muscle in vivo, 10 9 -PFU of the AdSM22-IacZ 
virus was injected intramuscularly into the rat rectus abdominus and quadriceps muscles. In 
contrast to the dense blue staining observed in visceral SMCs following direct injection into 
the wall of the ureter and bladder, P-galactosidase activity was not observed in either the 
rectus abdominus or quadriceps muscles. Thus, the lacZ reporter gene encoded by AdSM22- 
lacZ is expressed exclusively in visceral and vascular SMCs when administered intra- 
arterially, intravenously, or intra-muscularly. 
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Example 12 

Adenovirus Mediated Expression of a Cell Cycle Control Gene 

The Rb protein inhibits cell cycle progression in many mammalian cell types 
5 (Hollingsworth et al, Curr. Opin. Genet. Dev., 3:55, 1993), and has been shown to be an 
important regulator of vascular smooth muscle proliferation (Chang et ai, 1995a). In its 
unphosphorylated state, the Rb gene product binds and inactivates certain cellular 
transcription factors that are required for cell cycle progression (Chen et at., Cell, 58:1193, 
1989) and upon phosphorylation, the transcription factors are released and the cell progresses 

10 through the proliferation cycle. A gene encoding a phosphorylation deficient Rb gene 
product has been constructed and shown to constitutively inhibit smooth muscle cell cycle 
proliferation (Chang et aL, 1995a) when transfected into rat aortic smooth muscle cells in a 
replication defective adenovirus vector. Further, the Chang reference also shows that 
replication defective adenovirus vectors can be used to express heterologous genes in rat 

15 carotid arteries in vivo upon direct exposure of isolated segments of injured artery to the 
adenovirus. A similar study was done in isolated porcine arteries and again the adenoviral 
transferred constitutive Rb gene product was shown to be expressed and to inhibit smooth 
muscle cell proliferation. 

In a prophetic example of the present invention, this phosphorylation deficient Rb 
20 gene product may also be expressed under the control of the smooth muscle specific promoter 
contained in an adenovirus vector as disclosed herein, thus directing expression of the Rb 
gene product specifically in smooth muscle cells. Such administration is contemplated to 
arrest smooth muscle cell proliferation when the described vector expressing Rb from the 
SM22oc promoter is administered to an animal or human subject as described in the previous 
25 examples, particularly following arterial balloon injury. This method of preventing restenosis 
or other smooth muscle cell proliferative disorders offers the advantage of administration of 
the virus vector by a less invasive method such as intravenous injection. It is also 
contemplated that other cell cycle control gene products, such as p53 for example, would be 
effective in this method of preventing restenosis. 

30 
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Example 13 
Identificationof Smooth Muscle Specific 
Trans- Acting Transcription Factors 

Identification of nuclear protein binding sites in the SM22a promoter 

To identify nuclear protein binding sites within the 44 1 -bp SM22a promoter, DNase I 
footprint analyses were performed. Three overlapping genomic subfragments (bp -441 to - 
256, bp -256 to -89, and bp -89 to +41) spanning the 482-bp (bp -441 to +42) SM22cx 
promoter were subjected to DNase I footprint analyses using nuclear extracts from the SMC 
line, A7r5 (which express high levels of SM22a mRNA) and NIH 3T3 cells. The sense and 
antisense strands of the three genomic subfragments were end-labeled and incubated in the 
absence (control) or presence of A7r5 and NIH 3T3 (3T3) of nuclear extracts before partial 
digestion with DNase I (concentrations varied from 5 U/ml to 22.5 U/ml). Standard Maxam 
and Gilbert purine (G + A) sequencing reactions were run in parallel. The six protected 
regions identified on both strands with A7r5 nuclear extracts were designated smooth muscle 
elements (SME)-l-6, respectively. Two footprinted regions, SME-1 (bp -279 to -256) and 
SME-4 (bp -171 to -136), contain embedded SREs, or CArG boxes (CCWWWWWWGG, 
SEQ ID NO:47), that have been shown previously to bind the MADS box transcription factor, 
SRF, and play an important role in regulating transcription of the genes encoding skeletal and 
cardiac cc-actin (Minty et aL, 1986; Moss et <//., J. Biol. Chem., 269:12731, 1994; Muscat et 
al. 9 Gene Exp. 2:111, 1992). Fine differences in the digestion patterns between nuclear 
extracts prepared from A7r5 and NIH 3T3 cells could be distinguished over the SME-4 site. 
Several studies suggest that nucleotides embedded within and/or flanking CArG boxes 
regulate binding of ternary complex factors (TCFs) ? including members of the ets and 
homeodomain families of transcription factors. Thus, the finding that a PEA3 motif (bp -295 
to -289) ; which has been demonstrated to bind in vitro to ets family members, lies 23-bp 5' of 
the SME-1 motif is noteworthy. Similarly. SME-4 spans a GGAG motif (bp -142 to bp -139) 
which has been demonstrated to bind to TCFs in the ets family of transcription factors 
(Johansen and Prywes, Biochem. Biophys. Acta. 1242:1-10, 1995). Moreover, the SME-4 
motif contains the embedded motif ATATGG (bp -146 to bp -141) which has been 
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demonstrated to bind homeobox transcription factors including Csx/Nkx2.5 (Chen et al, 
1996). 

The SME-2 nuclear protein binding site (bp -249 and bp -216) contains consensus 
binding motifs for the ubiquitously expressed transcription factors, Spl (KRGGCKRRK) and 
AP2 (CCCMNSSS). Fine differences in the digestion patterns between nuclear extracts 
prepared from A7r5 and NIH 3T3 cells could be distinguished over this site. The SME-3 
nuclear protein binding site (bp -215 to bp -186), which is flanked by DNase I hypersensitive 
sites at both its 5' and 3' borders, was protected only by nuclear extracts prepared from A7r5 
and not by extracts prepared from NIH 3T3 cells. This nuclear protein binding site has not 
been described previously. The SME-5 nuclear protein binding site (bp -86 to bp -66) once 
again contains consensus Spl and AP2 motifs. The SME-6 nuclear protein binding site (bp - 
59 to -35), lies immediately 5' of the non-consensus TATA box (TTTAA), and contains 
nucleotide sequences that have been demonstrated previously to bind the cyclic AMP 
response element (CRE) binding proteins (for review see Lalli and Sassone-Corsi, 1994). An 
AT-rich sequence (bp -408 to -415) with 8/10 bp sequence identity with the consensus MEF2 
binding motif (Gossett et al 9 1989) was not protected with either A7r5 or NIH 3T3 nuclear 
extracts. Taken together, these studies demonstrated six nuclear protein binding sites within 
the murine SM22a promoter. Three of these binding sites (SME-2, SME-3 and SME-4) 
demonstrated differential patterns of digestion when incubated with nuclear extracts prepared 
from A7r5 and NIH 3T3 cells. 

Characterization of /raw-acting factors that bind to the SM22ct promoter. 

To assess the number, specificity, and identity, of nuclear proteins that bind to the 
arterial SMC-specific SM22cc promoter, a series of electrophoretic mobility shift assays 
(EMS As) were performed. To determine whether the SME-l/CArG and SME-4/CArG bind 
common, overlapping, or distinct, sets of /ram-acting factors, EMS As were performed using 
radiolabeled SME-1 and SME-4 oligonucleotide probes. The radiolabeled SME-1 
oligonucleotide probe bound three specific nuclear protein complexes, designated A-C, as 
determined by addition of specific and non-specific unlabeled competitor oligonucleotides to 
the binding reactions. Unlabeled SME-4 oligonucleotide competed for binding of complex 
A, but failed to compete for complexes B and C. Unlabeled Spl oligonucleotide competed 
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for binding of complex B (that co-migrated with complex A), as well as, complex C. 
Antibody supershift studies confirmed that complex A contains SRF (or an antigenically 
related protein) and complex B contains Spl (or an antigenically related protein). 

EMSAs performed with the radiolabeled SME-4 oligonucleotide probe demonstrated 
four specific nuclear protein complexes, designated A-D, as determined by addition of 
specific and non-specific competitor oligonucleotides. Addition of unlabeled SME-1 
oligonucleotide competed only for binding of complexes A and B. Antibody supershift 
studies revealed that both of these low-mobility nuclear protein complexes contained a 
protein identical, or antigenically-related, to SRF, while complexes C and D contained a 
protein identical, or antigenically-related, to YY1. Taken together, these data demonstrate 
that, as expected, SRF (or an SRF -containing protein complex) binds to both the SME-1 and 
SME-4 sites. The demonstration of two low mobility SME-4 binding activities containing 
SRF (complexes A and B) suggests that one, or both, of these complexes may contain 
additional /raw-acting factors. In addition, SME-1 bound Spl (complex B) and one 
potentially novel nuclear protein complex (complex C) that does not bind to SME-4. 
Conversely, SME-4 binds the ubiquitously expressed and potentially negative regulatory 
factor, YY1 (Gualberto et ai 9 Mol. Cell, Biol 12:4209, 1992; Lee et al. Proc. Natl. Acad. 
Sci., USA 89:9814, 1992; Lee et ai 9 Oncogene 9:1047, 1994) (complexes C and D), while 
SME-1 does not. 

Both the SME-2 and SME-5 sites are GC-rich motifs that contain potential Spl and 
AP2 motifs. EMSAs performed with nuclear extracts prepared from primary rat aortic SMCs 
and radiolabeled oligonucleotides corresponding to the SME-2 and SME-5 nuclear protein 
binding sites, respectively, revealed identical band-shift patterns suggesting that these two 
motifs might bind a common set of /ram-acting factors. Each probe bound three specific 
nuclear protein complexes, designated A-C, as determined by addition of unlabeled specific 
and non-specific oligonucleotide competitors. Unlabeled SME-2 oligonucleotide competed 
for binding of each nuclear protein complex that bound the radiolabeled SME-5 probe and 
visa versa. Moreover, an oligonucleotide containing a consensus Spl motif competed for 
binding of complexes X-C. Antibody supershift studies revealed that complex A was ablated 
and supershifted by pre-incubation with Spl -specific antiserum, but not by control murine 
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IgG, or a-AP2 antiserum. Each of these nuclear protein complexes were also present in 
nuclear extracts prepared from non-SMC lineages including the lymphoid lines, WEHI and 
70Z/3. These data demonstrate that the SME-2 and SME-5 nuclear protein binding sites each 
bind three ubiquitously expressed nuclear protein complexes, at least one of which contains a 
protein that is identical, or antigenically related, to Spl . 

As discussed above, SME-3 was protected from DNase I digestion by nuclear extracts 
prepared from A7r5 cells, but not by extracts prepared from N1H 3T3 cells, suggesting that 
this previously undescribed motif might bind one or more SMC lineage-specific (rans-aciing 
factors. EMSAs performed with the radiolabeled SME-3 oligonucleotide probe revealed 
three specific binding activities, designated A-C, as determined by addition specific and non- 
specific competitor oligonucleotides. Antibody supershift studies revealed that complex B 
and C contained YY1 (or an antigenically related protein). None of the nuclear protein 
complexes were supershifted by control IgG or a-Spl antiserum. To determine whether any 
of these nuclear protein complexes were expressed in a lineage-restricted fashion, EMSAs 
were performed with the SME-3 probe and nuclear extracts prepared from primary rat aortic 
SMCs, the SMC line, A7r5, C3H10T1/2 and NIH 3T3 fibroblasts, and the mouse T cell line, 
EL4. Interestingly, complex C, which was ablated by pre-incubation with a-YYl antiserum, 
was present only in primary rat aortic SMCs and the SMC line A7r5, but was absent in 
C3H10T1/2, NIH 3T3, and EL4 nuclear extracts. Moreover, three faint complexes were 
identified in C3H10T1/2, NIH 3T3 and EL4 cells, but were not present in SMC extracts. 
Taken together, these data suggest that the SME-3 nuclear protein binding site, a motif which 
has not been described previously, binds YY1 and one or more, as yet, unidentified SMC- 
specific and/or lineage restricted trans-acting factors. In addition, the radiolabeled SME-3 
probe binds three nuclear protein complexes that are present in several non-SMC lines, but 
not in primary vascular SMCs or the SMC line, A7r5. 

EMSAs performed with a radiolabeled oligonucleotide corresponding to the SME-6 
nuclear protein binding site revealed four specific nuclear protein complexes, designated A- 
D, respectively. Each of these complexes were competed with unlabeled SME-6 
oligonucleotide. Moreover, addition of an unlabeled consensus CRE oligonucleotide derived 
from the T cell receptor a enhancer competed exclusively for binding of complexes B and C. 
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Pre-incubation of the binding reactions with a-CREB-1 antiserum ablated and supershifted 
complex B, while complex C was ablated by addition of a- ATF- 1 antiserum. In addition, 
complex A was ablated and supershifted by pre-incubation with a-Spl antiserum. Finally, 
complex D was ablated by the addition of oc-YYl antiserum. In contrast, none of the four 
complexes were ablated or supershifted following pre-incubation with control rabbit or 
murine IgG, or antisera that recognize GATA-4 or SRF. Interestingly, EMSAs performed 
with the radiolabeled SME-6 oligonucleotide probe and nuclear extracts prepared from the 
non-SMC lines, C2C12 myotubes, C3H10T1/2 and NIH 3T3 fibroblasts, and EL4 T cells, 
revealed fine differences in the mobilities of several nuclear protein complexes (and/or novel 
complexes), as well as, increased intensity in each of the SME-6 binding activities. Taken 
together, these data revealed that the SME-6 motif binds CREB-1 and ATF- 1 , each of which 
are expressed in primary vascular SMCs, as well as, the ubiquitously expressed transcription 
factors, Spl and YY1. 

In summary, as shown in FIG. 2, the arterial SMC-specific SM22a promoter contains 
six nuclear protein binding sites, designated smooth muscle element (SME)l-6. respectively. 
SME-l/CArG binds SRF (and ternary complex factors), Spl and one unidentified nuclear 
protein complex that is not cross-competed by SME-4/CArG oligonucleotides. SME-2 binds 
three specific nuclear protein complexes at least one of which contains Spl, each of which 
also binds to the SME-5 site. SME-3, a motif that has not been described previously, binds 
YY1 and two unidentified nuclear protein complexes, one of which includes a potentially 
novel lineage-restricted trans-acting factor. In addition, the SME-3 motif binds several /ram- 
acting factors which are present in nuclear extracts prepared from non-SMCs but which are 
not present in SMC extracts. SME-4/CArG binds nuclear protein complexes containing SRF 
and YY1 -related proteins. Two high mobility complexes were ablated and supershifted by 
pre-incubation with a- SRF antiserum suggesting that one, or both, of these nuclear protein 
complexes may contain accessory factors. Finally, SME-6 binds CREB-1, ATF-1, YY1, and 
Spl. 



WO 98/15575 



PCT7US97/16204 



-56- 

Example 14 
Functional characterization of 
the SM22a promoter 

To characterize the functional significance of each of the m-acting elements within 
the SM22oc promoter, specific mutations that abolish binding of one or more /nmy-acting 
factors to nuclear protein binding sites located within the SM22a promoter were created 
within the context of the p-441SM22Iuc reporter plasmid. The effect of each mutation was 
assessed by transient transfection analysis of each mutant SM22a promoter luciferase 
reporter plasmid into primary rat aortic SMCs. To assess the function of the SME-l/CArG 
and SME-4/CArG sites, each of which bind SRF, mutations were created that abolish SRP 
binding to SME-1, and SRF and YY1 binding to SME-4, respectively. These mutations did 
not affect binding of any other nuclear protein complex (demonstrated by EMS A) to SME-1 
or SME-4. Transfection analyses revealed that mutation of the SME-1 site resulted in a 55% 
reduction in normalized luciferase activity compared to that obtained with the p-441SM221uc 
plasmid. Remarkably, a two nucleotide substitution in the SME-4 site that abolished SRF 
binding activity resulted in a 88% reduction in normalized luciferase activity compared to 
that obtained with the wild type SM22oc promoter. Moreover, the p-441SM22jaCArG 
plasmid, which contains mutations in both SME-1 and -4 that inhibit binding of SRF, 
completely abolished transcriptional activity of the SM22ct promoter in primary rat aortic 
SMCs and the SMC-line A7r5. These data demonstrate that the SME-1 and -4 nuclear 
protein binding sites are required for activity of the SM22a promoter in arterial SMCs in 
vitro. Moreover, these data suggest that SM22a promoter activity is critically-dependent 
upon the SME-4 site, SRF, and/or /ram--acting factors that interact with SRF. 

To assess the functional significance of each of the other (non-CArG containing) 
nuclear protein binding sites in the SM22ot promoter, mutations that abolish binding of one or 
more trans-acling factor to each site were created within the context of the 441 -bp SM22a 
promoter containing plasmid, p-441SM22tuc. Because the SME-2 and SME-5 nuclear 
protein binding sites, each bind a nuclear protein complex containing SpK in addition to two 
other common nuclear protein complexes, mutations were created within the context of the p- 
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441SM22-luc plasmid that abolish binding of each /ra/7^-acting factor to SME-2 r SME-5, and 
both SME-2 and SME-5. Transfection of each of these plasmids and the p-441SM22-luc 
plasmid into primary rat aortic SMCs demonstrated that mutation of the SME-2. SME-5, and 
SME-2 and SME-5, resulted in 58%, 6% and 70% respective reductions in normalized 
luciferase activities. These data suggest that within the context of the SM22a promoter, the 
SME-2, and -5 nuclear protein binding sites are required for full promoter activity, but may 
be functionally redundant. 

Mutation in the SME-3 site which abolishes binding of all three SME-3 binding 
activities (including the potentially novel lineage-restricted /raw-acting factor) resulted in a 
50% reduction in transcriptional activity compared to that observed with the native SM22a 
promoter. These data suggest either that activity of the SM22a promoter in arterial SMCs is 
not critically dependent on this potentially novel lineage-restricted /ram-acting factor, or 
alternatively, that an additional nuclear protein binding site for this lineage-restricted /ram- 
acting factor exists in the 441 -bp SM22a promoter (that was not detected by DNase I 
footprint analyses and EMSAs). To assess the functional significance of the SME-6 nuclear 
protein binding site, and to determine whether the CRE located within SME-6 is required for 
promoter activity, the -441SM22^iCREB/SME-6 plasmid, which abolishes binding 
specifically of each of the CRE-related complexes (but not YY1) was compared to the p- 
441SM221uc reporter plasmid. The single mutation within the CREB motif reduced 
transcriptional activity by approximately 60% . In contrast, mutations within SME-6 that do 
not abolish CRE binding activities did not significantly decrease transcriptional activities. 
These data suggest that CREB family members may play an important functional role in 
transcription of the SM22oc gene in VSMCs. 

The arterial SMC-specific SM22a promoter is CArG-dependent in vivo 

As shown above, mutations of the SME-l/CArG and SME-4/CArG elements that 
inhibited binding of SRF to the SM22ct promoter, totally abolished SM22a promoter activity 
in arterial SMCs in vitro. To determine whether SME-1 and -4 are required for activity of the 
SM22a promoter in arterial SMCs (and the myotomal component of the somites) in vivo, 
transgenic mice were produced containing a transgene, designated -441 SM22uCArG, that 
encodes the bacterial lacZ reporter gene under the transcriptional control of a mutant SM22a 
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promoter containing mutations in both SME-1 and SME-4 that abolish binding of SRF (as 
described above). Thirteen independent -441 SM22jaCArG transgenic lines were produced 
with copy numbers ranging between 1 and 730 copies per cell. In contrast to the - 
441SM22/tfcZ transgenic mice that expressed the lacZ transgene in the arterial SMCs and 
5 within the myotomal component of the somites, in 12 out of 13 independent - 
441SM22|iCArG lines, (3-galactosidase activity could not be detected in either the arterial 
SMCs or within the myotomal component of the somites at EDI 1.5. In one line harboring 
the -441SM22u,CArG transgene (that contained 5 copies per cell), blue staining was detected 
exclusively within the cardiac outflow tract, but not within the SMCs of the dorsal aorta or 

10 branch arteries, the somites, or any other tissue. Given the low frequency at which this 
pattern of lacZ expression was observed, it is likely that it resulted from integration of the 
transgene near a cryptic enhancer element. These data demonstrate that the SME-1 and SME- 
4 nuclear protein binding sites located within arterial SMC-specific SM22a promoter are 
required for SM22a promoter activity in vivo. Moreover, these data suggest strongly that 

15 SRF plays an important role in regulating activity of the SM22oc promoter in vivo. 

While the compositions and methods of this invention have been described in terms of 
preferred embodiments, it will be apparent to those of skill in the art that variations may be 
applied to the composition, methods and in the steps or in the sequence of steps of the method 
described herein without departing from the concept, spirit and scope of the invention. More 

20 specifically, it will be apparent that certain agents which are both chemically and 
physiologically related may be substituted for the agents described herein while the same or 
similar results would be achieved. All such similar substitutes and modifications apparent to 
those skilled in the art are deemed to be within the spirit, scope and concept of the invention 
as defined by the appended claims. 

25 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Arch Development Corporation 

(B) STREET: 1101 East 58th Street 

(C) CITY: Chicago 

(D) STATE: Illinois 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP) : 60637 

(A) NAME: Michael S. Parmacek 

(B) STREET: 1225 E. 56th Street 

(C) CITY: Chicago 

(D) STATE: IL 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP) : 60637 

(A) NAME: Julian Solway 

(B) STREET: 746 Grove Street 

(C) CITY: Glencoe 

(D) STATE: IL 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP) : 60022 

(ii) TITLE OF INVENTION: PROMOTER FOR SMOOTH MUSCLE -CELL EXPRESSION 
(iii) NUMBER OF SEQUENCES: 51 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 

(v) CURRENT APPLICATION DATA: 

APPLICATION NUMBER: Unknown 
(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/726,807 

(B) FILING DATE: 07-OCT-1996 



(2) INFORMATION FOR SEQ ID NO : 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1419 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
GAATTCAGGA CGTAATCAGT GGCTGGAAAG CAAGAGCTCT AGAGGAGCTC CAGCTTATTA 6 0 
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TGACCCTTCC TTCAGATGCC ACAAGGAGGT GCTGGAGTTC TATGCACCAA TAGCTTAAAC 12 0 

CAGCCAGGCT GGCTGTAGTG GATTGAGCGT CTGAGGCTGC ACCTCTCTGG CCTGCAGCCA 18 0 

GTTCTGGGTG AGACTGACCC TGCCTGAGGG TTCTCTCCTT CCCTCTCTCT ACTCCTTTCT 24 0 

CCCTCTCCCT CTCCCTCTCT CTGTTTCCTG AGGTTTCCAG GATTGGGGAT GGGACTCAGA 3 00 

GACACCACTA AAGCCTTACC TTTTAAGAAG TTGCATTCAG TGAGTGTGTG AGAC AT AG C A 360 

CAGATAGGGG CAGAGGAGAG CTGGTTCTGT CTCCACTGTG TTTGGTCTTG GGTACTGAAC 4 20 

TCAGACCATC AGGTGTGATA GCAGTTGTCT TTAACCCTAA CCCTGAGCCT GTCTCACCTG 48 0 

TCCCTTCCCA AGACCACTGA AG C TAG GTG C AAGATAAGTG GGGACCCTTT CTGAGGTGGT 54 0 

AGGATCTTTC AC GATAAG G A CTATTTTGAA GGGAGGGAGG G TG AC AC T G T CCTAGTCCTC 6 00 

TTACCCTAGT GTCTCCAGCC TTGCCAGGCC TTAAACATCC G CCCATTGTC ACCGCTCTAG 66 0 

AAGGGGCCAG GGTTGACTTG CTGCTAAACA AGGCACTCCC T AG AGAAG C A CCCGCTAGAA 720 

GCATACCATA CCTGTGGGCA GGATGACCCA TGTTCTGCCA CGCACTTGGT AGCCTTGGAA 780 

AGGCCACTTT GAACCTCAAT TTTCTCAACT GTTAAATGGG GTGGTAACTG CTATCTCATA 84 0 

ATAAAGGG G A ACGTGAAAGG AAGGCGTTTG CATAGTGCCT GGTTGTGCAG CCAGGCTGCA 90 0 

GTCAAGACTA GTTCCCACCA ACTCGATTTT AAAGCCTTGC AAGAAGGTGG CTTGTTTGTC 96 0 

CCTTGCAGGT TCCTTTGTCG GGCCAAACTC TAGAATGCCT CCCCCTTTCT TTCTCATTGA 102 0 

AGAGCAG AC C CAAGTCCGGG TAACAAGGAA GGGTTTCAGG GTCCTGCCCA TAAAAGGTTT 10 8 0 

TTCCCGGCCG CCCTCAGCAC CGCCCCGCCC CGACCCCCGC AGCATCTCCA AAGCATGCAG 114 0 

AGAATGTCTC CGGCTGCCCC CGACAGACTG CTCCAACTTG GTGTCTTTCC CCAAATATGG 12 0 0 

AGCCTGTGTG GAGTGAGTGG GGCGGCCCGG GGTGGTGAGC CAAGCAGACT TCCATGGGCA 12 6 0 

GGGAGGGGCG CCAGCGGACG GCAGAGGGGT GACATCACTG CCTAGGCGGC CTTTAAACCC 13 20 

CTCACCCAGC CGGCGCCCCA GCCCGTCTGC CCCAGCCCAG ACACCGAAGC TACTCTCCTT 138 0 

CCAGTCCACA AACGACCAAG CCTTGTAAGT GCAAGTCAT 1419 
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(2) INFORMATION FOR SEQ ID NO: 2: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 991 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 38. .218 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 322. .500 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION : 866 . .96 7 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



CTTTTCTCCA CACTCTATAC TTTAGCTCTG CCTCAAC ATG GCC AAC AAG GGT CCA 55 

Met Ala Asn Lys Gly Pro 
1 5 

TCC TAC GGC ATG AGC CGA GAA GTG CAG TCC AAA ATT GAG AAG AAG TAT 103 
Ser Tyr Gly Met Ser Arg Glu Val Gin Ser Lys lie Glu Lys Lys Tyr 
10 15 20 

GAC GAG GAG CTG GAG GAG CGA CTA GTG GAG TGG ATT GTA GTG CAG TGT 151 
Asp Glu Glu Leu Glu Glu Arg Leu Val Glu Trp He Val Val Gin Cys 
25 3 0 35 

GGC CCT GAT GTA GGC CGC CCA GAT CGT GGG CGC CTG GGC TTC CAG GTG 199 
Gly Pro Asp Val Gly Arg Pro Asp Arg Gly Arg Leu Gly Phe Gin Val 
40 45 50 

TGG CTG AAG AAT GGT GTG G TGAGTAACCC TTGCGAAGGG AATCTAGGGA 24 8 

Trp Leu Lys Asn Gly Val 
55 60 



TGTGTATGCC GCCCTACAAA CTG TGAG AC A GACTCCCTGA GCTGAGTGTT CAGTTGTGTT 3 08 

CTGTACCTGG CAG ATT CTG AGC AAA TTG GTG AAC AGC CTG TAT CCT GAG 3 57 

He Leu Ser Lys Leu Val Asn Ser Leu Tyr Pro Glu 
I 5 10 



GGA TCG AAG CCA GTG AAG GTG CCT GAG AAC CCA CCC TCC ATG GTC TTT 
Gly Ser Lys Pro Val Lys Val Pro Glu Asn Pro Pro Ser Met Val Phe 
15 20 25 



405 
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AAG CAG ATG GAA CAG GTG GCT CAA TTC TTG AAG GCA GCT GAA GAT TAT 4 53 

Lys Gin Met Glu Gin Val Ala Gin Phe Leu Lys Ala Ala Glu Asp Tyr 
30 35 40 

GGA GTC ATC AAG ACT GAC ATG TTC CAG ACT GTT GAC CTC TAT GAA GG 500 
Gly Val lie Lys Thr Asp Met Phe Gin Thr Val Asp Leu Tyr Glu 
45 50 55 

TATAAGGAAA AAAGGGCTGG AGCCAGTGGG CGAGTGGAGA G C AAG ATT AT CAGTCAAGGA 560 

GAAGGAATAT CAAAAGCCAC AACCAGCTCT GTTGATGTGT TCATAGCAGG AATGGGATAT 620 

G C C AAG AG AA C AC AT AG CAA GGGGACCAGC TTGGTGGTAC AGCATTTCCT TCTGGGTACA 680 

AGGGCCTGTT TTGGATCCTA GAATATCAAA TATATACCAC AC C AT ACT C A C TAG GG TT T A 74 0 

GAATATGGTC TCTTGAACCC TCTTGATTTG GTGCCACTTG CTCCTTGGTT GG AC C ATTTT 800 

TGAAGCTGGG CAGGTATTGC CTATATGGTC CTGAAATTAG CTCCCTGGCC ACTCTTCTCA 8 60 

TAGGT AAG GAT ATG GCA GCA GTG CAG AGG ACT CTA ATG GCT TTG GGC 907 
Lys Asp Met Ala Ala Val Gin Arg Thr Leu Met Ala Leu Gly 
15 10 

AGT TTG GCT GTG ACC AAA AAC GAT GGA AAC TAC CGT GGA GAT CCC AAC 9 55 

Ser Leu Ala Val Thr Lys Asn Asp Gly Asn Tyr Arg Gly Asp Pro Asn 
15 20 25 30 

TGG TTT ATG AAG TATGTGTCCA CTGGGTCTCT CTGT 9 91 

Trp Phe Met Lys 



(2) INFORMATION FOR SEQ ID NO : 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Ala Asn Lys Gly Pro Ser Tyr Gly Met Ser Arg Glu Val Gin Ser 
15 10 15 

Lys He Glu Lys Lys Tyr Asp Glu Glu Leu Glu Glu Arg Leu Val Glu 
20 25 30 

Trp He Val Val Gin Cys Gly Pro Asp Val Gly Arg Pro Asp Arg Gly 
35 40 45 



Arg Leu Gly Phe Gin Val Trp Leu Lys Asn Gly Val 
50 55 60 
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(2) INFORMATION FOR SEQ ID NO: 4: • 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

lie Leu Ser Lys Leu Val Asn Ser Leu Tyr Pro Glu Gly Ser Lys Pro 
15 10 15 

Val Lys Val Pro Glu Asn Pro Pro Ser Met Val Phe Lys Gin Met Glu 
20 25 30 

Gin Val Ala Gin Phe Leu Lys Ala Ala Glu Asp Tyr Gly Val lie Lys 
35 40 45 

Thr Asp Met Phe Gin Thr Val Asp Leu Tyr Glu 
50 55 



(2) INFORMATION FOR SEQ ID NO : 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5: 

Lys Asp Met Ala Ala Val Gin Arg Thr Leu Met Ala Leu Gly Ser Leu 
15 10 15 

Ala Val Thr Lys Asn Asp Gly Asn Tyr Arg Gly Asp Pro Asn Trp Phe 
20 25 30 

Met Lys 



(2) INFORMATION FOR SEQ ID NO: 6: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 575 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 2 8. .16 9 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

ACTTACCCTG GTTCCTTTTC TTCTAGG AAA GCC CAG GAG CAT AAG AGG GAC 51 

Lys Ala Gin Glu His Lys Arg Asp 
1 5 

TTC ACA GAC AGC CAA CTG CAG GAG GGG AAG CAC GTC ATT GGC CTT CAA 9 9 

Phe Thr Asp Ser Gin Leu Gin Glu Gly Lys His Val lie Gly Leu Gin 
10 15 20 

ATG GGC AGC AAC AGA GGA GCC TCG CAG GCT GGC ATG ACA GGC TAT GGG 14 7 

Met Gly Ser Asn Arg Gly Ala Ser Gin Ala Gly Met Thr Gly Tyr Gly 
25 30 35 40 

CGA CCC CGG CAG ATC ATC AGT T AGAAAGGGAA GGCCAGCCCT GAGCTGCAGC 19 9 
Arg Pro Arg Gin lie lie Ser 
45 

ATCCTGCTTA GCCTGCCTCA CAAATGCCTA TGTAGGTTCT TAG CC CTG AC AGCTCTGAGG 25 9 

TGTCACTGGG CAAAGATGAC TGCACATGGG CAGCTCCCAC CT ATC CTT AG CCTCAGCCCA 319 

GCATCTTACC CCAGAGCCAC CACTGCCCTG GCCCCTGTTC CCAGCTGTAC CCCCACCTCT 379 

ACTGTTCCTC TCATCCTGGA GTAAGCAGGG AGAAGTGGGC TGGGGTAGCT GGCTGTAGGC 439 

CAGCCCACTG T CCTTG AT AT CGAATGTCCT TTGAAGGAGA CCCAGCCCAG CCTCTACATC 4 99 

TTTTCCTGGA ATATGTTTTT GGGTTGAAAT TCAAAAAGGA AAAAAGAAAA ATATATAAAT 55 9 

ATATATATAT ATATAC 57 5 



(2) INFORMATION FOR SEQ ID NO : 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Lys Ala Gin Glu His Lys Arg Asp Phe Thr Asp Ser Gin Leu Gin Glu 
15 10 15 



Gly Lys His Val lie Gly Leu Gin Met Gly Ser Asn Arg Gly Ala Ser 
20 25 30 
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Gin Ala Gly Met Thr Gly Tyr Gly Arg Pro Arg Gin lie He Ser 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1102 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION:77 . .681 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8: 

GCCCGTCTGC CCCAGCCCAG AC AC CG AAG C TACTCTCCTT CCAGTCCACA AACGACCAAG 60 

CCTTCTCTGC CTCAAC ATG GCC AAC AAG GGT CCA TCC TAC GGC ATG AGC 109 
Met Ala Asn Lys Gly Pro Ser Tyr Gly Met Ser 
1 5 10 

CGA GAA GTG CAG TCC AAA ATT GAG AAG AAG TAT GAC GAG GAG CTG GAG 157 
Arg Glu Val Gin Ser Lys He Glu Lys Lys Tyr Asp Glu Glu Leu Glu 
15 20 25 

GAG CGA CTA GTG GAG TGG ATT GTA GTG CAG TGT GGC CCT GAT GTA GGC 2 05 

Glu Arg Leu Val Glu Trp lie Val Val Gin Cys Gly Pro Asp Val Gly 
30 35 40 

CGC CCA GAT CGT GGG CGC CTG GGC TTC CAG GTG TGG CTG AAG AAT GGT 2 53 

Arg Pro Asp Arg Gly Arg Leu Gly Phe Gin Val Trp Leu Lys Asn Gly 
45 50 55 

GTG ATT CTG AGC AAA TTG GTG AAC AGC CTG TAT CCT GAG GGA TCG AAG 3 01 

Val lie Leu Ser Lys Leu Val Asn Ser Leu Tyr Pro Glu Gly Ser Lys 
60 65 70 75 

CCA GTG AAG GTG CCT GAG AAC CCA CCC TCC ATG GTC TTT AAG CAG ATG 34 9 

Pro Val Lys Val Pro Glu Asn Pro Pro Ser Met Val Phe Lys Gin Met 
80 85 90 

GAA CAG GTG GCT CAA TTC TTG AAG GCA GCT GAA GAT TAT GGA GTC ATC 3 97 

Glu Gin Val Ala Gin Phe Leu Lys Ala Ala Glu Asp Tyr Gly Val He 
95 100 105 

AAG ACT GAC ATG TTC CAG ACT GTT GAC CTC TAT GAA GGT AAG GAT ATG 44 5 

Lys Thr Asp Met Phe Gin Thr Val Asp Leu Tyr Glu Gly Lys Asp Met 
HO 115 120 
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GCA GCA GTG CAG AGG ACT CTA ATG GCT TTG GGC AGT TTG GCT GTG ACC 4 93 

Ala Ala Val Gin Arg Thr Leu Met Ala Leu Gly Ser Leu Ala Val Thr 
125 130 135 

AAA AAC GAT GGA AAC TAC CGT GGA GAT CCC AAC TGG TTT ATG AAG AAA 541 
Lys Asn Asp Gly Asn Tyr Arg Gly Asp Pro Asn Trp Phe Met Lys Lys 
140 145 150 155 

GCC CAG GAG CAT AAG AGG GAC TTC ACA GAC AGC CAA CTG CAG GAG GGG 58 9 

Ala Gin Glu His Lys Arg Asp Phe Thr Asp Ser Gin Leu Gin Glu Gly 
160 165 170 

AAG CAC GTC ATT GGC CTT CAA ATG GGC AGC AAC AGA GGA GCC TCG CAG 63 7 

Lys His Val lie Gly Leu Gin Met Gly Ser Asn Arg Gly Ala Ser Gin 
175 180 185 

GCT GGC ATG ACA GGC TAT GGG CGA CCC CGG CAG ATC ATC AGT TA 681 
Ala Gly Met Thr Gly Tyr Gly Arg Pro Arg Gin lie He Ser 
190 195 200 

GAAAGGGAAG GCCAGCCCTG AGCTGCAGCA TCCTGCTTAG CCTGCCTCAC AAATGCCTAT 741 

GTAGGTTCTT AGCCCTGACA GCTCTGAGGT GTCACTGGGC AAAGATGACT G CAC ATG GG C 801 

AGCTCCCACC T ATC CTT AGC CTCAGCCCAG CATCTTACCC CAGAGCCACC ACTGCCCTGG 861 

CCCCTGTTCC CAGCTGTACC CCCACCTCTA CTGTTCCTCT CATCCTGGAG TAAGCAGGGA 921 

GAAGTGGGCT GGGGTAGCTG GCTGTAGGCC AGCCCACTGT C C TTG AT AT C GAATGTCCTT 981 

TGAAGGAGAC CCAGCCCAGC CTCTACATCT TTTCCTGGAA TATGTTTTTG GG TTG AAATT 1041 

CAAAAAGGAA AAAAGAAAAA TATATAAATA TATATATATA CAAAAAAAAA AAAAAAAAAA 1101 

A 1102 



(2) INFORMATION FOR SEQ ID NO : 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 201 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9: 

Met Ala Asn Lys Gly Pro Ser Tyr Gly Met Ser Arg Glu Val Gin Ser 
15 10 15 



Lys He Glu Lys Lys Tyr Asp Glu Glu Leu Glu Glu Arg Leu Val Glu 
20 25 30 
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Trp lie val Val Gin Cys Gly. Pro Asp Val Gly Arg Pro Asp Arg Gly 
3 5 4 0 4 5 

Arg Leu Gly Phe Gin Val Trp Leu Lys Asn Gly Val lie Leu Ser Lys 
50 55 60 

Leu Val Asn Ser Leu Tyr Pro Glu Gly Ser Lys Pro Val Lys Val Pro 
65 70 75 80 

Glu Asn Pro Pro Ser Met Val Phe Lys Gin Met Glu Gin Val Ala Gin 
85 90 95 

Phe Leu Lys Ala Ala Glu Asp Tyr Gly Val lie Lys Thr Asp Met Phe 
100 105 110 

Gin Thr Val Asp Leu Tyr Glu Gly Lys Asp Met Ala Ala Val Gin Arg 
115 120 125 

Thr Leu Met Ala Leu Gly Ser Leu Ala Val Thr Lys Asn Asp Gly Asn 
130 135 140 

Tyr Arg Gly Asp Pro Asn Trp Phe Met Lys Lys Ala Gin Glu His Lys 
145 150 155 160 

Arg Asp Phe Thr Asp Ser Gin Leu Gin Glu Gly Lys His Val lie Gly 
165 170 175 

Leu Gin Met Gly Ser Asn Arg Gly Ala Ser Gin Ala Gly Met Thr Gly 
180 185 190 

Tyr Gly Arg Pro Arg Gin lie lie Ser 
195 200 



(2> INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



ATCGAATTCC GCTACTCTCC TTCCAGCCCA CAAACGACCA AGC 



43 
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(2) INFORMATION FOR SEQ ID NO : 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
ATCAAGCTTG GTGGGAGCTG CCCATGTGCA GTC 3 3 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
TGCCGTAGGA TGGACCCTTG TTGGC 2 5 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : mod if ied_base 

(B) LOCATIONS 

(D) OTHER INFORMATION: /mod_base= OTHER 
/note= "Y = C or T/U" 

(ix) FEATURE: 

(A) NAME /KEY : modif ied_base 

(B) LOCATIONS 

(D) OTHER INFORMATION: /mod_base= OTHER 
/note= "W = A or T/U" 

(ix) FEATURE: 

(A) NAME /KEY : modi f ied_base 

(B) LOCATION: 10 

(D) OTHER INFORMATION: /mod_base= OTHER 
/note= "R = A or G" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



YTAWAAATAR 



10 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
TTTAAAATCG 10 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
TTCAAAATAG 10 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
Cys Cys Cys Met Asn Ser Ser Ser 



(2) INFORMATION FOR SEQ ID NO : 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 
•(D) TOPOLOGY: linear 



1 



5 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Lys Arg Gly Gly Cys Lys Arg Arg Lys 
1 5 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

{ D ) TOPOLOGY : 1 inear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Thr Lys Asn Asn Gly Asn Ala Ala Lys 
1 5 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

<D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Met He Arg He Cys Arg Lys Lys 
1 5 



(2) INFORMATION FOR SEQ ID NO : 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 381 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
AGTCAAGACT AGTTCCCACC AACTCGATTT TAAAGCCTTG CAAGAAGGTG GCTTGTTTGT 6 0 

CCCTTGCAGG TTCCTTTGTC GGGCCAAACT CTAGAATGCC TCCCCCTTTC TTTCTCATTG 12 0 

AAGAGCAGAC CCAAGTCCGG GTAACAAGGA AGGGTTTCAG GGTCCTGCCC ATAAAAGGTT 18 0 

TTTCCCGGCC GCCCTCAGCA CCGCCCCGCC CCGACCCCCG CAGCATCTCC AAAGCATGCA 24 0 

GAGAATGTCT CCGGCTGCCC CCGACAGACT GCTCCAACTT GGTGTCTTTC CCCAAATATG 3 00 
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GAGCCTGTGT GGAGTGAGTG GGGCGGCCCG GGGTGGTGAG CCAAGCAGAC TTCCATGGGC 360 
AGGGAGGGGC GCCAGCGGAC G 381 



(2) INFORMATION FOR SEQ ID NO : 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
AAGGAAGGGT TTCAGGGTCC TGCCCATAAA AGGTTTTTCC CGGCCGC 4 7 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 22: 
AAGGAAGGGT TTCAGGGTCC TG C C CAT AG A TCTTTTTTCC CGGCCGC 4 7 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
CCGCCCTCAG CACCGCCCCG CCCCGAGGCC CGCAGCATGT CCG 4 3 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
CCGCCCTCAG CACCGCGGAT CCCCGACCCC CGCAGCATCT CCG 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 25: 
CTCCAAAGCA TGCAGAGAAT GTCTCCGGCT GCCCCCG 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
CTCGGATCCA TGCTAGCAAT GAATTCGGCT GCCCCCG 



(2) INFORMATION FOR SEQ ID NO : 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 27: 
TCCAACTTGG TGTCTTTCCC CAAATATGGA GCCTGTGTGG AGTG 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid — 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
TCCAACTTGG TGTCTTTCCC CAAGGATCCA GCCTGTGTGG AGTG 4 4 



(2) I N FORMAT I ON FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 29: 
TCCAACTTGG TGTCTTTCCC CGGATATGGA GCCTGTGTGG AGTG 4 4 



(2) INFORMATION FOR SEQ ID NO : 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
TCCAACTTGG TGTCTTTCCC C AAATT AG G A GCCTGTGTGG AGTG 4 4 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 31: 
G GG C AG GG AG GGGCGCCAGC G 21 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



WO 98/15575 



PCT/US97/16204 



-74- 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
GGGCAGGTAC CGAATTCAGC G 



(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
GGACGGCAGA GGGGTGACAT CACTGCCTAG GCGGCCG 



(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
GGACGGCAGA GGGGATCCAT GCCTGCCTAG GCGGCCG 



(2) INFORMATION FOR SEQ ID NO : 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{ D ) TOPOLOGY : 1 inear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

GGACGGCAGA GGGGATCCAT CACTGCCTAG GCGGCCG 



(2) INFORMATION FOR SEQ ID NO : 36: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
CTGGCTAAAG GGGCGGGGCT TGGCCAGCC 



{2) INFORMATION FOR SEQ ID NO : 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
CTCCCATTTC CATGACGTCA TGGTTA 



(2) INFORMATION FOR SEQ ID NO : 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
AAGGAAGGGT TTCAGGGTCC TGCCCATAGA TCTTTTTTCC CGGCCGC 



(2) INFORMATION FOR SEQ ID NO : 39: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
CCGCCCTCAG CACCGCGGAT CCCCGACCCC CGCAGCATCT CCG 



(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
CTCGGATCCA TGCTAGCAAT GAATTCGGCT GCCCCCG 



(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 41: 
TCCAACTTGG TGTCTTTCCC CAAGGATCCA GCCTGTGTGG AGTG 



(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
TCCAACTTGG TGTCTTTCCC CGGATATGGA GCCTGTGTGG AGTG 



(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
TCCAACTTGG TGTCTTTCCC CAAATTAGGA GCCTGTGTGG AGTG 



(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 



GGGCAGGTAC CGAATTCAGC G 



21 



(2) INFORMATION FOR SEQ ID NO : 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 45: 
GGACGGCAGA GGGGATCCAT GCCTGCCTAG GCGGCCG 3 7 



(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 
GGACGGCAGA GGGGATCCAT CACTGCCTAG GCGGCCG 3 7 



(2) INFORMATION FOR SEQ ID NO : 47: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : modif ied_base 

(B) LOCATIONS. .8 

(D) OTHER INFORMATION: /mod_base= OTHER 



/note= 



"W = A or T" 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 47: 



CCWWWWWWCC 



10 
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(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 
CTCCAACTTG GTGTCTTTCC CCGGATATGG AGCCTGTGTG GAGTG 4 5 



(2) INFORMATION FOR SEQ ID NO : 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 49: 
CTCCAACTTG GTGTCTTTCC CCAAATTAGG AGCCTGTGTG GAGTG 4 5 



(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
CCAAATATGG 10 



(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
C C AT AT ATGG 



10 
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CLAIMS 

1. An isolated nucleic acid segment comprising an SM22a promoter, wherein said 
promoter is a segment of about 5,000 bases immediately upstream of the transcriptional start 

5 site of the murine SM22a genome and wherein said promoter is operatively linked to a 
heterologous nucleic acid sequence. 

2. An isolated nucleic acid segment of claim 1 , further defined as comprising a nucleic acid 
segment having a sequence according to bases 899-1382 of SEQ ID NO:l, or being 

10 hybridizable to the complement of bases 899-1382 of SEQ ID NO:l under high stringency 
conditions, and effective to promote transcription of a heterologous gene in a smooth muscle 
cell. 

3. The isolated nucleic acid segment of claim 1 , wherein said promoter sequence is further 
1 5 defined as comprising a contiguous sequence of bases 899- 1 3 82 of SEQ ID NO: 1 . 

4. The isolated nucleic acid segment of claim 1 , wherein said promoter sequence is further 
defined as comprising a contiguous sequence of bases 1-1382 of SEQ ID NO:l. 

20 5. The isolated nucleic acid segment of claim 1 , wherein said promoter sequence is further 
defined as comprising a contiguous sequence of bases 1060-1 382 of SEQ ID NO:l. 

6. The nucleic acid segment of claim 1, wherein said heterologous nucleic acid sequence 
encodes a cell cycle control gene, an angiogenesis gene or a cytotoxic gene. 

25 

7. The nucleic acid segment of claim 6, wherein said cell cycle control gene is selected 
from the group consisting of Rb, a phosphorylationdeficient Rb gene. p53, p21, pl6, p27, a ceil 
cycle dependent kinase inhibitor, E2F inhibitor, a CDK kinase or a cyclin gene. 
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8. The nucleic acid segment of claim 6, wherein said cell cycle control gene is a 
phosphorylation deficient Rb gene, p53, p21 or p!6. 

9. The nucleic acid segment of claim 6, wherein said angiogenesis gene is VEGF, iNOS. 
eNOS, basic FGF or FGF-5. 

1 0. The nucleic acid segment of claim 6, wherein said angiogenesis gene is VEGF, iNOS or 
eNOS. 

1 1 . The nucleic acid segment of claim 6, wherein said cytotoxic gene is a herpes simplex 
thymidine kinase gene. 

12. The nucleic acid segment of claim 6, wherein said heterologous nucleic acid sequence 
encodes an antisense RNA effective to inhibit expression of a cell cycle control gene. 

13. A recombinant vector comprising the isolated nucleic acid segment of claim 1. 

14. The recombinant vector of claim 13, further defined as a plasmid. 

15. The recombinant vector of claim 13, further defined as a viral vector. 

1 6. The recombinant vector of claim 1 5, wherein said viral vector is a bacteriophage vector, 
a raus sarcoma virus vector, a p21 virus vector an adeno-associated virus vector or an adenoviral 
vector. 

17. The recombinant vector of claim 16, wherein said vector is a replication defective 
adenovirus vector. 

18. The recombinant vector of claim 13, dispersed in a pharmaceutically acceptable 
solution. 
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19. A host cell wherein said cell contains the nucleic acid segment of claim 1. 

20. The host cell of claim 19, wherein said nucleic acid segment is contained in a vector. 

21 . The host cell of claim 1 9, wherein said host cell is a smooth muscle cell. 

22. The host cell of claim 2 1 , wherein said cell is an A7r5 cell. 

23. A replication deficient adenoviral vector, wherein said vector comprises a smooth 
muscle cell specific transcriptional regulatory segment. 

24. The vector of claim 23, wherein said smooth muscle cell specific transcriptional 
regulatory segment is an SM22a promoter, a smooth muscle calponin promoter, a smooth 
muscle myosin heavy chain promoter, a smooth muscle alpha actin promoter, a smooth muscle 
alpha actin enhancer, a telokin promoter, a smooth muscle gamma-actin promoter or a smooth 
muscle gamma-actin enhancer. 

25. The vector of claim 23, wherein said vector comprises an SM22a promoter segment 
operatively linked to a heterologous gene. 

26. The vector of claim 25, wherein said heterologous gene encodes a a cell cycle control 
gene, an angiogenesisgene or a cytotoxic gene. 

27. The vector of claim 26, wherein said cell cycle control gene is selected from the group 
consisting of Rb, a phosphorylation deficient Rb gene, p53, p2K pl6, p27, a cell cycle 
dependent kinase inhibitor. E2F inhibitor, a CDK kinase or a cyclin gene. 

28. The vector of claim 26. wherein said cell cycle control gene is a phosphorylation 
deficient Rb gene,p53 ? p21 or pi 6. 
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29. The vector of claim 26, wherein said angiogenesis gene is VEGF, iNOS, eNOS, basic 
FGF or FGF-5. 

30. The vector of claim 26, wherein said angiogenesis gene is VEGF, iNOS or eNOS. 

31. The vector of claim 26, wherein said cytotoxic gene is a herpes simplex thymidine 
kinase gene. 

32. The vector of claim 26, wherein said heterologous nucleic acid sequence encodes an 
antisense RNA effective to inhibit expression of a cell cycle control gene. 

33. The vector of claim 23, wherein said vector is dispersed in a pharmacolgically 
acceptable solution. 

34. A method of expressing a heterologous gene in a smooth muscle cell comprising the 
steps of: 

(a) obtaining a nucleic acid segment comprising a murine SM22a promoter region 
operatively linked to a heterologous gene, wherein said nucleic acid is contained 
in an adenoviral vector; 

(b) infecting said smooth muscle cell with said adenoviral vector; and 

(c) culturing said smooth muscle cell under conditions effective to express said 
gene. 

35. The method of claim 34, wherein said SM22a promoter comprises bases 899-1382 of 
SEQIDNO:!. 

36. The method of claim 34, wherein said heterologous gene is a reporter gene. 



37. 



The method of claim 34 r wherein said gene is a cell cycle control regulatory gene. 
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38. The method of claim 34, wherein said adenoviral vector is a replication deficient 
adenoviral vector. 

39. The method of claim 38, wherein said cell is in an animal and said vector is 
administered to said animal in a pharmacologically acceptable solution. 

40. A method of inhibiting smooth muscle cell proliferation comprising the steps of: 

(a) obtaining an isolated nucleic acid segment comprising a cell cycle regulatory 
gene operatively linked to an SM22a promoter region; 

(b) transferring said nucleic acid segment into a smooth muscle cell; and 

(c) maintaining said smooth muscle cell under conditions effective to express said 
cell cycle regulatory gene; 

wherein expression of said cell cycle regulatory gene inhibits proliferation of said smooth 
muscle cell. 

4 1 . The method of claim 40, wherein said smooth muscle cell is in an animal. 

42. The method of claim 40, wherein said cell cycle regulatory gene operatively linked to an 
SM22a promoter region comprises a viral or plasmid vector. 

43. The method of claim 42, wherein said viral vector is an adenoviral vector. 

44. The method of claim 40, wherein said cell cycle regulatory gene is selected from the 
group consisting of Rb, a phosphorylation deficient Rb gene, p53, p21, pi 6, p27 ; a cell cycle 
dependent kinase inhibitor, E2F inhibitor, a CDK kinase or a cyclin gene. 

45. A method of preventing restenosis in a subject following balloon angioplasty, 
comprising the steps of: 
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(a) obtaining an adenoviral vector comprising a cell cycle regulatory gene 
operatively linked to an SM22a promoter region dispersed in a pharmaceutically 
acceptable solution; and 

(b) administering said solution to said subject. 

46. The method of claim 45, wherein said cell cycle regulatory gene encodes a 
constitutively active Rb gene product. 

47. A method of promoting angiogenesis in a subject comprising the steps: 

(a) obtaining a nucleic acid segment comprising an angiogenesis factor gene 
operatively linked to an SM22a promoter region; and 

(b) transferring said nucleic acid segment into a smooth muscle cell to obtain a 
transfec ted cell; 

wherein expression of said nucleic acid segment in said smooth muscle cell promotes 
angiogenesis. 

48. The method of claim 47, wherein said smooth muscle cell is a coronary arterial or 
venous smooth muscle cell. 

49. The method of claim 47, wherein said smooth muscle cell is a peripheral arterial or 
venous smooth muscle cell. 

50. The method of claim 47, wherein said angiogenesis factor is VEGF. 

5 1 . The method of claim 47, wherein said nucleic acid segment comprising an angiogenesis 
factor gene operatively linked to an SM22a promoter region is contained in a viral or plasmid 
vector and said vector is administered to said subject. 



52. The method of claim 38, wherein said transferring is done ex vivo and the method 
further comprises the steps: 
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(a) seeding a bioprosthetic graft or stent with said transfected cells to obtain a 
seeded graft or stent; and 

(b) placing the seeded graft or stent into a coronary or peripheral artery or vein of a 
subject. 

5 

53. A method of inhibiting smooth muscle proliferation comprising the steps of: 

(a) obtaining a nucleic acid segment comprising a cell cycle regulatory gene 
operatively linked to an SM22ct promoter region; 

(b) transferring said nucleic acid segment into a primary smooth muscle cell ex vivo 
1 0 to obtain a transfected cell; 

(c) seeding a bioprosthetic graft or stent with said transfected cell to obtain a seeded 
graft or stent; and 

(d) placing the seeded graft or stent into a coronary or peripheral artery or vein of a 
subject; 

15 wherein expression of said cell cycle regulatory gene inhibits proliferation of a smooth muscle 
cell. 
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